Status: publishable MVP with CLI, JSON API wrapper, and static web demo.
This utility is the first build candidate from the Utility-to-Usage scanner.
Clean messy CSV files by:
- Normalizing headers into stable API-friendly names.
- Trimming cells.
- Dropping empty rows.
- Padding short rows.
- Preserving overflow cells by adding
extra_ncolumns. - Removing duplicate rows.
- Returning a cleaned CSV plus a machine-readable cleanup report.
- RapidAPI micro-API with free tier and paid bulk rows.
- Free web tool with paid export or batch mode.
- Local CLI as proof of capability for marketplace listing screenshots and examples.
This MVP uses only user-provided or sample CSV files. Public publishing is allowed under the standing Utility-to-Usage permission as long as it does not require spending money, payout setup, legal/tax/payment details, OTP/MFA, CAPTCHA, account-security changes, or contractual commitments.
Clean the sample CSV from the command line:
C:\Users\lauku\.cache\codex-runtimes\codex-primary-runtime\dependencies\python\python.exe .\csv_cleaner.py .\samples\messy.csv --out .\samples\cleaned.csv --report .\samples\report.jsonStart the local API server:
C:\Users\lauku\.cache\codex-runtimes\codex-primary-runtime\dependencies\python\python.exe .\api_app.py --host 127.0.0.1 --port 8000Call the JSON endpoint:
$body = @{
csv_text = "Full Name,Email`n Alice , alice@example.com`nAlice,alice@example.com`n"
key_columns = @("full_name", "email")
} | ConvertTo-Json
Invoke-RestMethod -Method Post -Uri http://127.0.0.1:8000/clean-csv -ContentType application/json -Body $bodyLive demo:
https://csv-dedupe-normalize-api.vercel.app
The static demo in index.html calls the Vercel-style endpoint at /api/clean_csv.
Serverless endpoint:
POST /api/clean_csv
Content-Type: application/json
Request body:
{
"csv_text": "Full Name,Email\n Alice , alice@example.com\nAlice,alice@example.com\n",
"key_columns": ["full_name", "email"]
}From this folder:
C:\Users\lauku\.cache\codex-runtimes\codex-primary-runtime\dependencies\python\python.exe -m unittest discover -s .\tests- GitHub: public open-source repository.
- Vercel: deployed at https://csv-dedupe-normalize-api.vercel.app.
- RapidAPI: marketplace listing after endpoint host, pricing, and account path are confirmed without hard-stop prompts.