Skip to content

Attention Mechanism in Sequence-to-Sequence learning and Pointer Neural Nets to extract the attributes of an gun violence events from news reports.

License

Notifications You must be signed in to change notification settings

srjit/pointer-seq2seq-gun-violence-event-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Gun violence event analysis from news paper corpus using Attention Mechanism in Sequence to Sequence Networks & Pointer Networks

Folders

  • Corpus: Collect news from google news via rss feed filter them by keywords and add to a postgres database

Steps - Preprocessing

  1. Postgres
    • Create database news in postgres
    • Run: Create Table gnews_search_results(scrapetime timestamp, rss_id varchar(20), queryinfo varchar(30),url varchar(100),keywords varchar(100),entry varchar(400));
    • Run: create table page_downloads(url text, downloadtime timestamp, status_code text, http_header text, page_content text, source text)
  2. Data Collection
    • Run gnews_scrape.py (collects google news results)
    • Run corpus/consolidated_page_downloader.py (collects htmls)
  3. Text Extraction
    • Use Lynx browser to extract text from the webpages collected

About

Attention Mechanism in Sequence-to-Sequence learning and Pointer Neural Nets to extract the attributes of an gun violence events from news reports.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages