Skip to content

Commit 5e43e1f

Browse files
committed
data update
1 parent 5c7c448 commit 5e43e1f

6 files changed

+8422
-5
lines changed

data/README.md

+15-5
Original file line numberDiff line numberDiff line change
@@ -2,17 +2,27 @@
22

33
We will use freely available historical data from market, fundamental and alternative sources. Chapter 2, Market and Fundamental Data and Chapter 3, Alternative Data for Finance cover characteristics and access to these data sources and introduce key providers that we will use throughout the book.
44

5-
A few sample data sources that we will source and work with include, but are not limited to:
5+
A few sample data sources that we will source and work with include, among others:
6+
- Quandl daily prices and other data points for over 3,000 US stocks
7+
- Algoseek minute bar trade and quote price data for NASDAQ 100 stocks
8+
- Stooq daily price data on Japanese equities and US ETFs and stocks
9+
- Yahoo finance daily price data and fundamentals for US stocks
610
- NASDAQ ITCH order book data
711
- Electronic Data Gathering, Analysis, and Retrieval (EDGAR) SEC filings
812
- Earnings call transcripts from Seeking Alpha
9-
- Quandl daily prices and other data points for over 3,000 US stocks
1013
- Various macro fundamental data from the Federal Reserve and others
11-
- Large Yelp business reviews and Twitter datasets
14+
- Financial news data from Reuters, etc.
15+
- Twitter sentiment data
16+
- Yelp business reviews sentiment data
1217

1318
## How to source the Data
1419

15-
The notebook [create_datasets](create_datasets.ipynb) contains information on downloading the Quandl Wiki stock prices and a few other sources that we use throughout the book.
20+
There are several notebooks that guide you through the data sourcing process:
21+
- The notebook [create_datasets](create_datasets.ipynb) contains information on downloading the **Quandl Wiki stock prices** and a few other sources that we use throughout the book, such as S&P500 benchmark, and US equities metadata.
22+
- The notebook [create_stooq_data](create_stooq_data.ipynb) demonstrates how to download historical prices for Japanese stocks and US stocks and ETFs from STOOQ.
23+
- The notebook [create_yelp_review_data](create_yelp_review_data.ipynb) combines text data with additional numerical features for sentiment analysis from Yelp user reviews.
24+
- The notebook [glove_word_vectors](glove_word_vectors.ipynb) downloads pre-trained word vectors.
25+
- The notebook [twitter_sentiment](twitter_sentiment.ipynb) downloads and extracts twitter data for sentiment analysis.
1626

17-
Instructions to obtain data sources for specific applications are provided in the relevant directories and notebooks of this repository.
27+
In addition, instructions to obtain data sources for specific applications are provided in the relevant directories and notebooks.
1828

0 commit comments

Comments
 (0)