Wikipedia talk:Database download
- Please note that questions about the database download are more likely to be answered on the xmldatadumps-l or wikitech-l mailing lists than on this talk page.
Is there a way to get diff update periodically?
>> pages-articles.xml.bz2 – Current revisions only, no talk or user pages; this is probably what you want, and is approximately 11 GB compressed (expands to over 49 GB when uncompressed).
If I want to cache whole Wikipedia locally, I can download the whole 11 GB. But What if I want to update periodically once a day or once a week? Do I have to download again whole 11 GB or is there a better way?
Frustrating and NOT fair
This is frustrating: I'm looking for a link to download wikipedia but nothing. These pages are too technical for "normal" people to understand. Even then, the pages talk about xlm and do not provide a context, etc. There's still too long to go for these resources to become democratic. BTW, where's all money we donate going? can some go to make the terminology more friendly to normal people? — Preceding unsigned comment added by 22.214.171.124 (talk) 22:33, 17 April 2017 (UTC)
- These pages are not intended for users without technical knowledge. Reader programs/apps should deal with this in the background. Some things should be more clear though, especially multistream - I don't understand why wikimedia is even still offering the non-multistream archive. I mean, if it's for the savings, offer a 7z instead and you can knock another 20% off. Neither do I understand why the hell this page is recommending you get the non-multistream archive. W3ird N3rd (talk) 03:09, 14 July 2017 (UTC)
I have downloaded the pagelinks table, on disk it's about 32GB. Does anyone know of how big it is once imported into a database? My import is currently greater than 50GB and still going.
File system limits
A file is data - operating systems can and do split that data and have it stored upon many parts of a medium (fragmentation). Everyone knows that a non-fragmented data file is limited by size. — Preceding unsigned comment added by 126.96.36.199 (talk) 07:41, 28 September 2016 (UTC)