{"id":114,"date":"2016-08-18T22:02:44","date_gmt":"2016-08-18T16:32:44","guid":{"rendered":"http:\/\/madhurendra.com\/?p=114"},"modified":"2016-08-18T22:02:44","modified_gmt":"2016-08-18T16:32:44","slug":"offline-wikipedia-elasticsearch-mediawiki","status":"publish","type":"post","link":"https:\/\/madhurendra.com\/offline-wikipedia-elasticsearch-mediawiki\/","title":{"rendered":"Offline Wikipedia with Elasticsearch and MediaWiki"},"content":{"rendered":"

Wikipedia is Awesome ! It’s open, its free – Yea. & its huge in size, millions of articles\u00a0<\/a>\u00a0but as developer how to exploit the free knowledge.<\/p>\n

I started digging internet just to find ways to exploit my fresh 9+ GB of XML Gzipped archive which seemed to me of no use as even a simple text editor can’t open it. (Just out of excitement what’s inside, how its structured, Schema ! )<\/p>\n

Luckily people have already imported it. Elasticsearch is fast, reliable & its good for searching, so\u00a0https:\/\/github.com\/andrewvc\/wikiparse<\/a> was a saver.<\/p>\n