Vector Database Ingestion Pipeline CLI

A professional, interactive CLI tool for parsing PDFs, generating embeddings with Google Gemini, and pushing data to a Pinecone vector database. Built for modern AI data workflows and designed to impress!

Features

Interactive CLI with ASCII art banner
Create Pinecone Indexes on the fly
Parse and upsert PDFs from a folder (no manual path entry)
Query your vector database with natural language
Google Gemini Embeddings for high-quality vectorization
Modern, modular code (Node.js, ES Modules)

Quick Start

Clone the repo

git clone https://github.com/Lilsax/pinecone-data-pipeline.git
cd vector-data-base

Install dependencies
```
npm install
```
Set up your environment variables
- Create a .env file in the root directory:
```
PINECONE_API_KEY=your-pinecone-key
GOOGLE_API_KEY=your-google-api-key
```
Add your PDFs to the files/ directory.
Run the CLI
```
node index.js
```

Usage

Create Index: Create a new Pinecone index interactively.
Parse/Upsert PDF: Select a PDF from the files/ folder and push its embeddings to Pinecone.
Query: Enter a natural language query to search your vector database.

Why This Project?

Showcases real-world AI data engineering
Demonstrates modern Node.js best practices
Ready for production or portfolio
Easy to extend for your own use cases

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
files		files
test/data		test/data
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
index.js		index.js
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vector Database Ingestion Pipeline CLI

Features

Quick Start

Usage

Why This Project?

About

Uh oh!

Releases

Packages

Languages

Lilsax/pinecone-data-pipeline

Folders and files

Latest commit

History

Repository files navigation

Vector Database Ingestion Pipeline CLI

Features

Quick Start

Usage

Why This Project?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages