DuckDB Text-2-SQL Bench

Enterprise

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

cfahlgren1 updated a dataset about 18 hours ago

duckdb-nsql-hub/duckdb-nsql-scores

cfahlgren1 updated a dataset about 18 hours ago

duckdb-nsql-hub/duckdb-nsql-predictions

cfahlgren1 updated a Space about 1 month ago

duckdb-nsql-hub/DuckDB-NSQL-Leaderboard

View all activity

duckdb-nsql-hub's activity

cfahlgren1

updated 2 datasets about 18 hours ago

duckdb-nsql-hub/duckdb-nsql-scores

Viewer • Updated about 18 hours ago • 113 • 47

duckdb-nsql-hub/duckdb-nsql-predictions

Viewer • Updated about 18 hours ago • 2.4k • 44

cfahlgren1

posted an update 2 days ago

Post

983

Wow, I just added Langfuse tracing to the Deepseek Artifacts app and it's really nice 🔥

It allows me to visualize and track more things along with the cfahlgren1/react-code-instructions dataset.

It was just added as a one click Docker Space template, so it's super easy to self host 💪

cfahlgren1

posted an update 8 days ago

Post

1932

You'll notice the AI in the SQL Console is much better at working with chatml conversations:

Here's example of unnesting the cfahlgren1/react-code-instructions in less than 10 seconds by asking it. Check it out here: cfahlgren1/react-code-instructions

- "show me the average assistant response length"
- "extract user, system, and assistant messages into separate columns"

It's super easy to work with conversational datasets now with natural language 🗣️

cfahlgren1

posted an update 12 days ago

Post

3296

The deepseek-ai/DeepSeek-V3 is very good! I have been playing with it and found it is really good at one-shotting a pretty good landing page.

You can play with it here: https://deepseek-artifacts.vercel.app

All the responses get saved in the cfahlgren1/react-code-instructions dataset. Hopefully we can build one of the biggest, highest quality frontend datasets on the hub 💪

cfahlgren1

updated a Space about 1 month ago

Running

📊

DuckDB NSQL Leaderboard

cfahlgren1

posted an update about 1 month ago

Post

1933

You can just ask things 🗣️

"show me messages in the coding category that are in the top 10% of reward model scores"

Download really high quality instructions from the Llama3.1 405B synthetic dataset 🔥

argilla/magpie-ultra-v1.0

cfahlgren1

updated a dataset about 1 month ago

duckdb-nsql-hub/sql-console-prompt

Viewer • Updated Dec 3, 2024 • 1 • 81 • 9

cfahlgren1

posted an update about 1 month ago

Post

3019

We just dropped an LLM inside the SQL Console 🤯

The amazing, new Qwen/Qwen2.5-Coder-32B-Instruct model can now write SQL for any Hugging Face dataset ✨

It's 2025, you shouldn't be hand writing SQL! This is a big step in making it where anyone can do in depth analysis on a dataset. Let us know what you think 🤗

tdoehmen

updated a Space about 1 month ago

Running

🦆

DuckDB SQL Eval

tdoehmen

in duckdb-nsql-hub/DuckDB-SQL-Eval about 1 month ago

Add new prompt to constants

#2 opened about 1 month ago by

tdoehmen

Add bench prompt

#1 opened about 1 month ago by

tdoehmen

cfahlgren1

posted an update about 2 months ago

Post

919

observers 🔭 - automatically log all OpenAI compatible requests to a dataset💽

• supports any OpenAI compatible endpoint 💪
• supports DuckDB, Hugging Face Datasets, and Argilla as stores

> pip install observers

No complex framework. Just a few lines of code to start sending your traces somewhere. Let us know what you think! @davidberenstein1957 and I will continue iterating!

Here's an example dataset that was logged to Hugging Face from Ollama: cfahlgren1/llama-3.1-awesome-chatgpt-prompts

cfahlgren1

posted an update about 2 months ago

Post

876

You can create charts, leaderboards, and filters on top of any Hugging Face dataset in less than a minute

• ASCII Bar Charts 📊
• Powered by DuckDB WASM ⚡
• Download results to Parquet 💽
• Embed and Share results with friends 📬

Do you have any interesting queries?

cfahlgren1

posted an update about 2 months ago

Post

748

What rank are you on Hugging Face Top Yappers? 🗣️

Find your rank here with this link: cfahlgren1/hub-stats

The Top 3:
- @fdaudens
- @singhsidhukuldeep
- @akhaliq

I am at #71 and need to get my numbers up! 📈

4 replies

cfahlgren1

posted an update about 2 months ago

Post

3126

You can clean and format datasets entirely in the browser with a few lines of SQL.

In this post, I replicate the process @mlabonne used to clean the new microsoft/orca-agentinstruct-1M-v1 dataset.

The cleaning process consists of:
- Joining the separate splits together / add split column
- Converting string messages into list of structs
- Removing empty system prompts

https://huggingface.co/blog/cfahlgren1/the-beginners-guide-to-cleaning-a-dataset

Here's his new cleaned dataset: mlabonne/orca-agentinstruct-1M-v1-cleaned

1 reply

cfahlgren1

posted an update about 2 months ago

Post

2235

Why use Google Drive when you can have:

• Free storage with generous limits🆓
• Dataset Viewer (Sorting, Filtering, FTS) 🔍
• Third Party Library Support
• SQL Console 🟧
• Security 🔒
• Community, Reach, and Visibility 📈

It's a no brainer!

Check out our post on what you get instantly out of the box when you create a dataset.
https://huggingface.co/blog/researcher-dataset-sharing

1 reply

cfahlgren1

posted an update 3 months ago

Post

1163

If you are like me, I like to find up and coming datasets and spaces before everyone else.

I made a trending repo space cfahlgren1/trending-repos where it shows:

- New up and coming Spaces in the last day
- New up and coming Datasets in the last 2 weeks

It's a really good way to find some new gems before they become popular. For example, someone is working on a way to dynamically create assets inside a video game here: gptcall/AI-Game-Creator

cfahlgren1

posted an update 4 months ago

Post

1887

Have you tried the new SQL Console yet?

Would love to know any queries you've tried or general feedback! If you haven't go try it out and let us know 🤗

If you have some interesting queries feel free to share the URLs as well!

1 reply

cfahlgren1

posted an update 4 months ago

Post

1130

Made a fun Space powered by Llama 405B for creating real, working react apps with the awesome plus that you can contribute to an open react dataset by upvoting or downvoting the response 🤗.

https://huggingface.co/spaces/cfahlgren1/llama-artifacts

cfahlgren1/react-code-instructions

1 reply

AI & ML interests

Recent Activity

Team members 2

duckdb-nsql-hub's activity

DuckDB NSQL Leaderboard

DuckDB SQL Eval

Add new prompt to constants

Add bench prompt