This repository contains final dataset from https://github.com/xmvlad/nlp_wiki_topic project. Clone this repository and unpack wiki_topic_dataset.tar.gz
In this project dataset from Wikipedia was collected, it contains top level topic names from Wikipedia articles and related text.