Introduction

The project contains a task to pull data off xkcd and stores into MySQL database.

Python Setup

Create virtualenv

virtualenv venv
# virtualenv -p=python3 venv  
# (If `Py3` is not the default version)

Activate virtualenv

source venv/bin/activate

Install dependencies (using virtual environment is recommended):

pip install -r requirements.txt

Database setup instructions

To setup, database execute sql script database_.sql (contains DDL) in MySQL query console.
You should create a file as commons/settings/secrets.yaml to contain database credentials. Example -

# ---LOCAL---
LOCALSQL_USER: root
LOCALSQL_HOST: 127.0.0.1
LOCALSQL_PORT: 3306
LOCALSQL_PASSWORD: xxxxx
LOCALSQL_DATABASE: xkcdDB

mysql database installation -

https://tecadmin.net/install-mysql-8-centos-8/

Note : This project has been tested against mysql-8.0.21 using RHEL8.4. We recommend yum installation for RPM-based software installations. More instructions for installing MySQL on Linux.

NOTE: pymysql installation and related versions of mysql should be matched.

Usage

Activate virtual environment and start app with python command. Example -

source venv/bin/activate
python task_one.py

Alternatively, you can also pass command line arguments to specify max comic ids (--max) and set of random size (--any).

# example (full options)
python task_one.py --max 87 --any 15

# OR  (short options)
python task_one.py -m 87 -a 15

Good luck!

Running tests

Run following command for all the test cases

pytest

Development notes

The database modeling for the xkcd API goes like follows: 
Name of models -

# comics    
 
num: INT(11) NOT NULL PRIMARY KEY
month: VARCHAR(250)
link: VARCHAR(250)
year: VARCHAR(250)
news: VARCHAR(250)
safe_title: VARCHAR(250)
transcript: VARCHAR(5000)
alt: VARCHAR(5000)
img: VARCHAR(250)
title: VARCHAR(250)
day: VARCHAR(250)

NOTE : Readers are requested to raise PRs with `problems identified` with the script.

coding style

Since the code has been written in Python3.x, function annotations and type-hinting has been
used across.
Google-Styled docstrings have been used to describe functions/classes/modules.
Pydantic data-classes have been used to validate the responses from xkcd API endpoints.
Set your IDE character limit per line to maximum 100 (recommendation)
Set your configurations via common/settings/secrets.yaml; Do NOT commit file containing secrets.
The generic functionality has been maintained under commons.
An independent data access layer (commons/dal.py) has been created to isolate data retrieval and storage functionality.

Task 1

GET 15 random comics and following details in using Python.
- Get the names of the comic
- Get the alt-text of the comic
- Get the number of the comic
- Get the link of the comic
- Get the image of the comics
- Get the image Link of the comics
Insert into MySQL - Please include SQL database schema(s) for any table(s) created in your Github repo

OUTPUT OF TASK 1 (as of timestamp: '2021-06-09 18:31:20')

[
  {
    "comic": "Staceys Dad",
    "comic_meta": {
      "alt_text": "I bet she gets you to mow the lawn, doesnt she?",
      "number": 61,
      "link": "https://www.xkcd.com/61",
      "image": "staceys_dad.jpg",
      "image_link": "https://imgs.xkcd.com/comics/staceys_dad.jpg"
    }
  },
...,
...,
  {
    "comic": "Valentine - Karnaugh",
    "comic_meta": {
      "alt_text": "Love and circuit analysis, hand in hand at last.",
      "number": 62,
      "link": "https://www.xkcd.com/62",
      "image": "karnaugh.jpg",
      "image_link": "https://imgs.xkcd.com/comics/karnaugh.jpg"
    }
  }
]

Notes/Warnings

Random number generator may produce some integers IDs within range(1, 87) which may not yield any results from xkcd API (404s). In which case, we skip those IDs and store the rest.

Future scopes

** [TODO] try another approach ** - 

Crawl through all the urls from xkcd API first, resolve dependecies endpoint-by-endpoint 
and store into record tables.
Finally, use local database to produce results per ask in the task.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Python Setup

Database setup instructions

Usage

Running tests

Development notes

coding style

Task 1

Notes/Warnings

Future scopes

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
commons		commons
models		models
tests		tests
.gitignore		.gitignore
README.md		README.md
database_.sql		database_.sql
requirements.dev.txt		requirements.dev.txt
requirements.txt		requirements.txt
task_one.py		task_one.py

sumepati/xkcdAPI

Folders and files

Latest commit

History

Repository files navigation

Introduction

Python Setup

Database setup instructions

Usage

Running tests

Development notes

coding style

Task 1

Notes/Warnings

Future scopes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages