Skip to content

ColinLeverger/beer-scraper-scala

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Beer Scraper in Scala

Simple scrapper, developed using scala 2.12.1, ruippeixotog/scala-scraper and scalatest.

Use of FlatSpecs from scalatest to work in TDD.

Scrapping the website http://craftcans.com/db.php?search=all&sort=beerid&ord=desc&view=text.

Followed the tutorial http://blog.kaggle.com/2017/01/31/scraping-for-craft-beers-a-dataset-creation-tutorial/ to clean the data (work in progress)

How does it work?

Run it yourself!

git clone https://github.com/ColinLeverger/beer-scraper-scala`
sbt test
sbt run

What about the process?

  1. Connect to the website and download the HTML
  2. Parse it with the library scala-scraper
  3. Create a list of Beers case class objects
  4. Write CSV
  5. Done!

Sample output

See this file. Variables cleaned.

About

Beer Scraper in Scala

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages