literallyp1

A Twitter bot that scrapes, tweets top stories from certain newspaper front pages

It grabs top stories from newspaper websites that have their print front page available online and tweets the top story's headline, byline and link, along with a .png image of the headline, byline and deck of up to the next five page-one stories.

Right now, it's only for The New York Times, and it's tweeted out here.

This project was borne from a conversation with an old journalist friend about news cycles, social media and feeds. You don't have to be a newspaper person to know that Page One is a big deal; plenty of movie tropes show reporters scrambling to get their story on the front, and a roomful of editors arguing over placement. For the outlets that take it seriously, it's a big decision to distill the day's events into the most important that will fit on one page.

Of course, these headlines are still out there in the different feeds. But the labor and human decision-making that go into the day's top stories is even more obscured, simply because most people aren't looking. This bot aims to simply tweet the top story, with an image of the next several stories' details.

Usage

Get some keys and access tokens from Twitter, and hide them in a good spot. I put mine in a gitignored file called config.py and import the variables.

Currently, running python3 literallyp1.py will tweet the top six stories from the from the front page of The New York Times. More newspapers to come (TK, for the news nerds); see the issues and pitch in if you can.

To send tweets regularly without manually running that command, set up a cron job. I don't know when the Times updates its "Today's Paper" webpage, but 4am EST is a good bet. Up to you if you want to do this from an always-on-and-connected computer, or from a cloud service like Heroku

For a good how-to on getting Twitter keys and setting this up on Heroku, see this post by Brian Caffey.

Molly White has another good blog post on setting up a Twitter bot that also explains cron jobs.

Installation

Clone repo and run pip install -r requirements.txt.

Requirements

This package needs requests, BeatifulSoup and lxml to get the story details; Pillow to make the tweet image; and tweepy to do the tweeting.

Authors

literallyp1 was written by Rick Valenzuela.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
literallyp1		literallyp1
.gitignore		.gitignore
LICENSE		LICENSE
README.rst		README.rst
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

literallyp1

Usage

Installation

Requirements

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

literallyp1

Usage

Installation

Requirements

Authors

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages