Crawler_and_sentiment_analysis
In settings.py change variable
CLOSESPIDER_ITEMCOUNT = 10
cd
in spiders folder. Then type in cmd line
scrapy crawl quotes
scrapy crawl quotes -o items.json
return to one earlier folder with cd ..
then
python inv_index.py
python query_processing.py Concordia
the result of the query is saved in the text file named Result_01.txt
dataset.txt
-- [general informations about all the files analysed]
infoForQueries.txt
-- [url, length in words of this url, sentiment value for this url]
sentiment_for_each_url.txt
: -- [url, sentiment value for this url]