bitprj · ryansxl · Mar 9, 2020 · Mar 9, 2020 · Mar 9, 2020 · Mar 18, 2020
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/1.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/1.md
@@ -1,17 +1,16 @@
-<!--title={Introduction}-->
+<!--title={lab tool}-->
 
-## tweepy
+## lab tool
 
-For this lab we will utilize the skills you've gained working with APIs to visualize tweets using the **tweepy** Twitter API.
+In order to do this project, at first, we should create environment for this lab, and create your id and keys to use the tool.
 
-The idea is simple, given a topic, all hashtags with greater than 5% frequency pertaining to that topic are plotted in a pie graph. All hashtags with less than 5% frequency fall under an "Other" category.
+For this lab, we will use Python and the API of Tweet `tweepy`. 
 
-Hashtags provide an efficient way of deducing how tweeters feel about the topic they are tweeting about, since Twitter users use hashtags to summarize their tweets, often with more emotion. Therefore hashtags provide a sufficient summary of the tweet - there is a lesser need to process every character and word of a tweet if the hashtags are available. 
+In order to use the API, we need to get keys from the website of Twitter.
 
-By seeing the most common hashtags associated with a topic, we can evaluate what Twitter users are discussing under the scope of a greater topic and how people feel about the topic at hand. It's easy to get caught in our own echo chambers on social media, and analyzing the most common hashtags across *all* tweets for a certain topic helps us analyze the feelings behind a topic in a more objective manner. 
+#### Steps:
 
-Here is an example of what we will be aiming to accomplish at the end of this lab:
-
-![](https://projectbit.s3-us-west-1.amazonaws.com/darlene/labs/pieplot.png)
+1. Create environment.
+2. Authentification.
 
 
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/11.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/11.md
@@ -0,0 +1,14 @@
+<!--title={Create Environment}--> 
+
+## Create Environment
+
+In order to do this project, at first, we should create environment for this lab.
+
+For this lab, we will use Python and the API of Tweet 'Tweepy'.
+
+We will use `pip install` to install all the packages. Be sure to distinguish  `pip3` and `pip`.
+
+#### Steps:
+
+1. Access `tweepy` API
+2. Load packages
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/111.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/111.md
@@ -1,6 +1,14 @@
 <!--title={Accessing the Tweepy API}-->
 
-## Install tweepy and get keys
+## Access the Tweepy API
+
+In order to access the `tweepy` api, there are 2 steps we need to do:
+
+1. Install `tweepy` api.
+
+2. Get keys.
+
+#### Install tweepy
 
 If you're on Anaconda installin the Tweepy API is simple, just type the following: 
 
@@ -10,8 +18,7 @@ conda install tweepy
 
 After installing the API you will also have to create a developer account with Twitter in order to access the API, this process is quick and straightforward. Just click [this](https://developer.twitter.com/en/apply-for-access.html) link to get started.
 
-### Get keys
+#### Get keys
 
 In the previous step, when you register you can get important keys for your future exploration. You should clip on the button of "create app" and finish some questions.
 
-![](https://github.com/ryansxl/xshuai/blob/master/111.png?raw=true)
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/112.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/112.md
@@ -1,6 +1,6 @@
 <!--title={Loading Packages}-->
 
-## Import packages
+## Load Packages
 
 The following packages will need to be installed in order to complete the necessary functions of the lab. By now you are already familiar with loading Python packages thanks to your previous labs.
 

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/12.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/12.md
@@ -1,11 +1,13 @@
 <!--title={Authentification}-->
 
-## steps
+## Authentification
 
 In order to complete the various functions and methods we will perform, we need to login to twitter through our program. 
 
-To complete this we need to go through 3 simple steps:
+To complete this we need to go through 3 simple steps.
+
+#### Steps:
 
 1. Define your search keys
 2. Create the access token to login
-3. Finally, accessing the API
+3.  Access the API
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/121.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/121.md
@@ -1,8 +1,8 @@
 <!--title={Defining Keys}--> 
 
-## Get the keys
+## Define Keys
 
-Defining the keys to login is simple, the information you need is all provided when you're developer account is created (The keys you get in card 111md). Type the following to store it in a variable:
+Defining the keys to login is simple, the information you need is all provided when your developer account is created (The keys you get in card 111md). Type the following to store it in a variable:
 
 ``` python
 consumer_key= 'yourkeyhere'

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/122.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/122.md
@@ -1,6 +1,8 @@
 <!--title={Creating Access Token}-->
 
-Now we must create our access token, this is a key step to complete the login process.
+## Creating Access Token
+
+Now we must create our access token, which is a key step to complete the login process.
 
 First, store the token values in a variable:
 
@@ -9,7 +11,7 @@ access_token= 'yourkeyhere'
 access_token_secret= 'yourkeyhere'
 ```
 
-Second, use the OAuthHandler() and set_access_token() methods to create the instance that will allow login.
+Second, use the `OAuthHandler()` and `set_access_token()` methods to create the instance that will allow login.
 
 ``` python
 auth = tw.OAuthHandler(consumer_key, consumer_secret)

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/123.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/123.md
@@ -1,8 +1,13 @@
 <!--title={Accessing the API}-->
 
-Now that we have created our access token we can finally access the API, this can be done in a simple line.
+## Accessing the API
+
+Now that we have created our access token we can finally access the API, which can be done in a simple line.
 
 ``` python
 api = tw.API(auth, wait_on_rate_limit=True)
 ```
 
+`auth` is reponsible for authenticating your access to the API via your keys
+
+`wait_on_rate_limit` will specify whether the function call will wait if you have called the API too many times (instead of quitting), since the API has a limited amount of times you can call the API
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/2.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/2.md
@@ -1,7 +1,16 @@
 <!--title={Finding Tweets}-->
 
-## Get data about  "climate change"
+## Finding Tweets
 
 Now that we've authenticated we're ready to search for tweets. Let's start by searching for all tweets surrounding the topic of climate change. ("climate change" being your query string)
 
-![sample image](https://www.diggitmagazine.com/sites/default/files/styles/inline_image/public/Climate%20change%20photo_1.jpg?itok=2BfiKsqU)
+![](./images/earth.png)
+
+
+
+In this card, we will get the data of tweet with the topic of "climate change"
+
+#### Steps:
+
+1. Get data of tweet with the topic of "climate change".
+2. Store the data.
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/21.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/21.md
@@ -1,6 +1,6 @@
 <!--title={The -filter method}-->
 
-## Get data about "climate change"
+## The -filter method
 
 In order to search for tweets under our desired hashtag, we will use the -filter method to find tweets under the climate change hashtag.
 

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/22.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/22.md
@@ -1,5 +1,5 @@
 <!--title={Using Tweets}-->
 
-## Store the data
+## Using Tweets
 
 Now that we've found the recent tweets containg the hashtags that we will eventually analyze, we need to store the tweets in an organized manner for analysis.
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/221.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/221.md
@@ -1,5 +1,7 @@
 <!--title={Grabbing 1000 Recent Tweets}-->
 
+## Grabbing 1000 Recent Tweets
+
 For our analysis we need an accurate sample size for credible findings. We will grab 1,000 tweets under the climate change hashtag for our analysis.
 
 To accomplish this we will use the Cursor method to iterate through the tweets, you may remember seeing this method from a previous lab.

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/222.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/222.md
@@ -1,5 +1,7 @@
 <!--title={Adding Tweets to a List}-->
 
+## Adding Tweets to a List
+
 Now we can use list comprehension to iterate through our recently found items in a list. 
 
 ``` python

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/3.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/3.md
@@ -1,7 +1,7 @@
 <!--title={Cleaning Tweets}--> 
 
-## Deal with Data
+## Cleaning Tweets
 
-As you saw from the output of our lists there are links to the tweets. While this may be nice to track the source of the tweets it will be a hinderance when parsing through the list for analysis.
+The output of our lists are links to the tweets. Although this may be nice to track the source of the tweets, it will be a hinderance when parsing through the list for analysis.
 
-We will use regular expressions to accomplish the data cleaning. Throughout the previous labs you have gone through you may by now know that cleaning data is the longest portion of analysis projects.
+We will use regular expressions to accomplish the data cleaning. Throughout the previous labs you have gone through, you now know that cleaning data is the first portion of analysis projects.
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/311.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/311.md
@@ -1,6 +1,6 @@
 <!--title={Using Regular Expressions}-->
 
-## Module re
+## Using Regular Expressions
 
 You may remember seeing ```import re``` while we were loading our packages earlier. Re stands for ```regular expressions```. Regular expressions are a special syntax that is used to identify patterns in a string.
 

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/312.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/312.md
@@ -1,6 +1,6 @@
 <!--title={re.sub method}-->
 
-## re.sub
+## re.sub method
 
 `re.sub` allows you to substitute a selection of characters defined using a regular expression, with something else.
 

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/313.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/313.md
@@ -1,6 +1,6 @@
 <!--title={Creating a remove_url function}-->
 
-## Create a "remove" function
+## Creating a remove_url function
 
 Using the re.sub method we just looked at we can create a function that removes urls from the items of our list.
 

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/32.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/32.md
@@ -1,6 +1,6 @@
 <!--title={Creating a List of Clean Tweets}-->
 
-## Delete URL in data
+## Creating a List of Clean Tweets
 
 Now that we have finished removing urls from our tweets we can add them to a list for analysis. 
 

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/33.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/33.md
@@ -1,6 +1,6 @@
 <!--title={Addressing Case Issues}-->
 
-## Deal with list
+## Addressing Case Issues
 
 Another challenge we will address is capitalization which becomes a challenge with data analysis for text data. 
 

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/331.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/331.md
@@ -1,6 +1,6 @@
 <!--title={.lower() Method}-->
 
-## Make word lowercase
+## .lower() Method
 
 To begin to remedy this issue we can  make each word lowercase using the string method `.lower()`. In the code below, this method is applied using a list comprehension.
 

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/332.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/332.md
@@ -1,6 +1,6 @@
 <!--title={set() Method}--> 
 
-## Unique list
+## set() Method}
 
 Now all of the words in your list are lowercase. You can again use `set()` function to return only unique words.
 

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/333.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/333.md
@@ -1,6 +1,6 @@
 <!--title={Creating a List of Lower Case Words from Tweets}-->
 
-## Deal with list
+## Creating a List of Lower Case Words from Tweets
 
 Right now, you have a list of lists that contains each full tweet and you know how to lowercase the words.
 

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/4.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/4.md
@@ -1,6 +1,8 @@
 <!--title={Calculating Hashtag Frequency}-->
 
-Now we will incorporate some elementary math to enable us to display the frequencies of each hashtag and plot it as you will see later.
+## Calculating Hashtag Frequency
 
-To get the count of how many times each word appears in the sample, you can use the built-in `Python` library `collections`, which helps create a special type of a `Python dictionary.`
+Now we will incorporate some elementary math methods which can enable us to display the frequencies of each hashtag and plot it as you will see later.
+
+To get the count of how many times each word appears in the sample, you can use the built-in `Python` library `collections`, which helps us create a special type of a `Python dictionary.`
 
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/5.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/5.md
@@ -1,6 +1,6 @@
 <!--title={Plotting Hashtag Frequency}-->
 
-## Visualize the data
+## Plot Hashtag Frequency
 
 Now that we have cleaned the data (seemingly) we can plot it to show our findings!
 

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/51.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/51.md
@@ -1,5 +1,7 @@
 <!--title={Using the pd.DataFrame}-->
 
+## Using the pd.DataFrame
+
 Based on the counter, you can create a `Pandas Dataframe` for analysis and plotting that includes only the top 15 most common words.
 
 ``` python

diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/52.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/52.md
@@ -1,5 +1,7 @@
 <!--title={Creating a Visualization}-->
 
+## Creating a Visualization
+
 Using this `Pandas Dataframe`, you can create a horizontal bar graph of the top 15 most common words in the tweets as shown below.
 
 ```python
@@ -20,5 +22,5 @@ These are simple commands and paramters that we have encountered before. The plo
 
 With that, we are now done! Below is the output of the common words found in our Tweets.
 
-![Imgur](https://i.imgur.com/GloG9zm.png)
+![Imgur](./images/result.png)
 
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/images/earth.png b/Module_Twitter_API/labs/Twitter Hashtag Frequency/images/earth.png
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/images/pieplot.png b/Module_Twitter_API/labs/Twitter Hashtag Frequency/images/pieplot.png
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/images/result.png b/Module_Twitter_API/labs/Twitter Hashtag Frequency/images/result.png
diff --git a/Module_Twitter_API/labs/Twitter Hashtag Frequency/readme.md b/Module_Twitter_API/labs/Twitter Hashtag Frequency/readme.md
@@ -0,0 +1,30 @@
+<!--title={Introduction}-->
+
+For this lab we will utilize the skills you've gained working with APIs to visualize tweets using the **tweepy** Twitter API.
+
+The idea is simple, given a topic, all hashtags with greater than 5% frequency pertaining to that topic are plotted in a pie graph. All hashtags with less than 5% frequency fall under an "Other" category.
+
+Hashtags provide an efficient way of deducing how tweeters feel about the topic they are tweeting about, since Twitter users use hashtags to summarize their tweets, often with more emotion. Therefore hashtags provide a sufficient summary of the tweet - there is a lesser need to process every character and word of a tweet if the hashtags are available. 
+
+By seeing the most common hashtags associated with a topic, we can evaluate what Twitter users are discussing under the scope of a greater topic and how people feel about the topic at hand. It's easy to get caught in our own echo chambers on social media, and analyzing the most common hashtags across *all* tweets for a certain topic helps us analyze the feelings behind a topic in a more objective manner. 
+
+In the end of this card, we will visualization the frequency of hashtags in tweets. To achieve this goal, there are serveral steps to do.
+
+#### Steps:
+
+1. Install lab tool
+
+2. Find tweets
+
+3. Clean tweet data
+
+4. Calculate hashtags frequency
+
+5. plot hashtags frequency
+
+   #### Examples:
+
+Here is an example of what we will be aiming to accomplish at the end of this lab:
+
+![](./images/pieplot.png)
+