The following dumps are avalible:
special | wikibooks | wikinews | wikipedia | wikiquote | wiktionary | images
At the root of each Wikimedia project dump, you will find a listing of languages and the md5 sum for all files available in the tree. Each language directory contains these XML dumps:
pages_current.xml.gz – Link to the lastest current pages dump
all_titles_in_ns0.gz – All titles in the main namespace
pages_full.xml.gz – Link to the lastest full pages dump
Now what could a search engine spammer possiably do with these dumps and the markov chain?
Well, with the language feed and markov, you can now spam the search engines in tongues you don’t even speak. With markov, keep in mind that you need about 300+% imput to output ratio for optimal articles.

RSS Feed
Twitter
November 23rd, 2005
QuadsZilla
Posted in 

holy shit, thats alot of data.
Great news!
I’m about to cry… This will be EXCELLENT with my 5,563,359 row database of world cities and towns in both english and native character sets, UTF, latitude, longitude, etc. Thank you wikipedia and SEO BlackHat, THANK YOU!!! Terabyte site hosting anyone???
this so great!!