This is a data extraction based of online content using python(spider).
And there's no restriction for usage, just remember to cite :D
- ctext
- without en.csv lists all the parts without english translation.
- data_extraction_ctext.py is the original file to obtain all the data.
- the forlders list all the data from the website with both Chinese and English translation, stored with csv format.
- Jataka Stories
- data_extraction_jataka.py is the original file to obtain all the data.
- the csv files list all the data from the website with both Chinese/Sanskrit/Pali and English translation