|
2 | 2 |
|
3 | 3 | We will use freely available historical data from market, fundamental and alternative sources. Chapter 2, Market and Fundamental Data and Chapter 3, Alternative Data for Finance cover characteristics and access to these data sources and introduce key providers that we will use throughout the book.
|
4 | 4 |
|
5 |
| -A few sample data sources that we will source and work with include, but are not limited to: |
| 5 | +A few sample data sources that we will source and work with include, among others: |
| 6 | +- Quandl daily prices and other data points for over 3,000 US stocks |
| 7 | +- Algoseek minute bar trade and quote price data for NASDAQ 100 stocks |
| 8 | +- Stooq daily price data on Japanese equities and US ETFs and stocks |
| 9 | +- Yahoo finance daily price data and fundamentals for US stocks |
6 | 10 | - NASDAQ ITCH order book data
|
7 | 11 | - Electronic Data Gathering, Analysis, and Retrieval (EDGAR) SEC filings
|
8 | 12 | - Earnings call transcripts from Seeking Alpha
|
9 |
| -- Quandl daily prices and other data points for over 3,000 US stocks |
10 | 13 | - Various macro fundamental data from the Federal Reserve and others
|
11 |
| -- Large Yelp business reviews and Twitter datasets |
| 14 | +- Financial news data from Reuters, etc. |
| 15 | +- Twitter sentiment data |
| 16 | +- Yelp business reviews sentiment data |
12 | 17 |
|
13 | 18 | ## How to source the Data
|
14 | 19 |
|
15 |
| -The notebook [create_datasets](create_datasets.ipynb) contains information on downloading the Quandl Wiki stock prices and a few other sources that we use throughout the book. |
| 20 | +There are several notebooks that guide you through the data sourcing process: |
| 21 | +- The notebook [create_datasets](create_datasets.ipynb) contains information on downloading the **Quandl Wiki stock prices** and a few other sources that we use throughout the book, such as S&P500 benchmark, and US equities metadata. |
| 22 | +- The notebook [create_stooq_data](create_stooq_data.ipynb) demonstrates how to download historical prices for Japanese stocks and US stocks and ETFs from STOOQ. |
| 23 | +- The notebook [create_yelp_review_data](create_yelp_review_data.ipynb) combines text data with additional numerical features for sentiment analysis from Yelp user reviews. |
| 24 | +- The notebook [glove_word_vectors](glove_word_vectors.ipynb) downloads pre-trained word vectors. |
| 25 | +- The notebook [twitter_sentiment](twitter_sentiment.ipynb) downloads and extracts twitter data for sentiment analysis. |
16 | 26 |
|
17 |
| -Instructions to obtain data sources for specific applications are provided in the relevant directories and notebooks of this repository. |
| 27 | +In addition, instructions to obtain data sources for specific applications are provided in the relevant directories and notebooks. |
18 | 28 |
|
0 commit comments