Skip to content

haraldur12/Charles-Dickens-NLTK

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Charles-Dickens-NLTK

################

This is the final project for Research Tools and Techniques class which aims to determine whether Charles Dickens' was able to improve his vocabulary as he kept writing.

################

In the frequencies folder the word lists of the each novel can be found under the same name of the novel. The files present in the folder gives us a glimpse of how many words he had used and how often he used them.

################

In the charts folder the visualization of the most frequent 100 words from each novel can be seen.

###############

The corpora consist of five novels published at different times to see the flow of improvement in the author's vocabulary.

##############

Publication dates can be found the dates_of_publication.txt

#############

About

A project that I had for my NLTK practises

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages