ta.TwitterDB.loadWordsData

TwitterDB.loadWordsData(inc)[source]

Method to load the tweet words in a separate collection in mongoDB. It creates the tweetWords collection.

Parameters

inc – used to determine how many tweets will be processed at a time. A large number may cause out of memory errors, and a low number may take a long time to run, so the decision of what number to use should be made based on the hardware specification. the string to clean

Examples

>>> loadWordsData(50000)