🐬 Phin: The CSUCI Companion

A Retrieval-Augmented Generation (RAG) Assistant for CSU Channel Islands

Phin is an AI-powered academic assistant designed to help students at CSUCI navigate their academic journey. Built on the OpenAI Assistants API (GPT-4 Turbo), Phin replaces static keyword searches with natural conversation, offering tailored course recommendations, natural language scheduling, and real-time answers to campus queries.

📸 Demo

Left: Natural Language Course Recs | Right: RAG-based Event Retrieval

🚀 Key Features

🎓 Intelligent Course Recommendations: Suggests classes based on major, interests, and prerequisites.
🗓️ Natural Language Scheduling: "Agentic" scheduling that optimizes class times based on personal constraints (e.g., "Keep my Fridays free").
🏫 Real-Time Campus Knowledge: RAG pipeline integrated with scraping to answer questions about clubs, events, and deadlines.
🔗 Deep Integration: Custom-built data ingestion pipeline for up-to-date course catalogs and event data.

🛠️ Architecture

flowchart TB
  U([User]) -->|Query| T[Create/Retrieve Thread]
  T --> P{{Phin}}

  %% CSUCI-specific retrieval
  P -->|CSUCI Specific| KR[Query CSUCI Data Sources]
  KR --> IDX[Retrieve from CSUCI Knowledge Base]
  IDX --> LLM

  %% General
  P -->|General Query| LLM[Use GPT-4 Turbo Model]

  %% Analytical / code
  P -->|Analytical| CI[Code Interpreter]
  CI --> SETUP[Setup Environment & Execute Code]
  SETUP --> LLM

  %% Output
  LLM --> GEN[Generate / Format Response]
  GEN --> D([Display to User])

Tech Stack: Flask · LangChain · OpenAI Assistants API · Scrapy · Twisted

🕷️ Data Engineering

Unlike generic chatbots, Phin relies on ground-truth data directly from the university. We engineered a custom ETL pipeline to ensure accuracy:

Course Catalog Scraper

Tooling: Utilized Scrapy for its asynchronous event-driven architecture (Twisted reactor).
Logic: The crawler systematically traverses the CSUCI course catalog hierarchy (Subjects → Course Details), parsing HTML structures to extract prerequisites, units, and descriptions.
Output: Exports standardized JSON feeds (see data/samples/sample_output.json) which are then indexed for retrieval by the RAG system.

Check the data-ingestion/course-scraper directory for the crawler implementation.

📂 Repository Structure

CSUCI_Companion/
├── app.py                      # Flask application entry point
├── requirements.txt            # Dependencies
├── data-ingestion/
│   └── course-scraper/         # Custom Scrapy crawler (Async/Twisted)
├── data/
│   └── samples/                # Sample JSON outputs from scraper
├── static/                     # CSS/JS assets
├── images/                     # Documentation images
└── README.md

⚡ Quickstart

Prerequisites: Python 3.10+, OpenAI API Key, and Assistant ID.

# 1) Create & activate a virtual env
python -m venv .venv
source .venv/bin/activate

# 2) Install libraries
pip install -r requirements.txt

# 3) Add your keys (secret.py is gitignored)
cat > secret.py <<'PY'
api_key = "sk-..."
assistant_id = "asst_..."
secret_key = "change-me"
PY

# 4) Run the app
flask --app app run --port 8000
# visit http://localhost:8000

Environment Variables (secret.py)

Variable	Required	Description
`api_key`	✅	OpenAI API Key
`assistant_id`	✅	OpenAI Assistant ID to run
`secret_key`	⚙️	Flask session security string

ℹ️ Project Origin

Developed as a Computer Science Capstone at CSU Channel Islands.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🐬 Phin: The CSUCI Companion

📸 Demo

🚀 Key Features

🛠️ Architecture

🕷️ Data Engineering

Course Catalog Scraper

📂 Repository Structure

⚡ Quickstart

Environment Variables (secret.py)

ℹ️ Project Origin

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
data-ingestion/course-scraper		data-ingestion/course-scraper
data/samples		data/samples
images		images
static		static
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

OronaDaniel/CSUCI_Companion

Folders and files

Latest commit

History

Repository files navigation

🐬 Phin: The CSUCI Companion

📸 Demo

🚀 Key Features

🛠️ Architecture

🕷️ Data Engineering

Course Catalog Scraper

📂 Repository Structure

⚡ Quickstart

Environment Variables (secret.py)

ℹ️ Project Origin

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages