1 2 33

Leon Knauer

reuank

reuank

AI & ML interests

None yet

Recent Activity

liked a model 9 days ago

DebateLabKIT/Llama-3.1-Argunaut-1-8B-SFT

liked a dataset 9 days ago

DebateLabKIT/deep-argmap-conversations

liked a dataset 9 days ago

DebateLabKIT/deepa2-conversations

View all activity

Organizations

None yet

reuank's activity

liked a model 9 days ago

DebateLabKIT/Llama-3.1-Argunaut-1-8B-SFT

Text Generation • Updated 8 days ago • 105 • 5

liked 2 datasets 9 days ago

DebateLabKIT/deep-argmap-conversations

Viewer • Updated 11 days ago • 604k • 29 • 1

DebateLabKIT/deepa2-conversations

Viewer • Updated 11 days ago • 371k • 19 • 1

liked a model 4 months ago

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 621 • 1.71k

reacted to ggbetz's post with 🔥 4 months ago

Post

1190

🧭 Guided Reasoning

👋Hi everyone,

We've been releasing Guided Reasoning:

Our AI guides walk your favorite LLM through complex reasoning problems.

🎯 Goals:

1️⃣ Reliability. AIs consistently follow reasoning methods.
2️⃣ Self-explainability. AIs see reasoning protocols and can explain internal deliberation.
3️⃣ Contestability. Users may amend AI reasoning and revise plausibility assessments.

Try out Guided Reasoning with our light demo chatbot, powered by 🤗 HuggingFace's free Inference Api and small LLMs. (Sorry for poor latency and limited availability -- we are currently searching for 💸 compute sponsors to run more powerful models, faster, and optimize guided reasoning performance.)

Built on top of Logikon's open-source AI reasoning analytics.

Demo chat app: logikon/benjamin-chat
Github: https://github.com/logikon-ai/logikon
Technical report: https://arxiv.org/abs/2408.16331

➡️ Check it out and get involved! Looking forward to hearing from you.

liked a Space 5 months ago

Running

🐨

Open Multilingual Llm Leaderboard

liked 2 models 6 months ago

meta-llama/Llama-3.1-405B-Instruct

Text Generation • Updated Sep 25, 2024 • 28.2k • 559

facebook/multi-token-prediction

Updated Jun 18, 2024 • 354

liked a dataset 7 months ago

tuanh23/SciEx

Viewer • Updated Oct 7, 2024 • 6 • 4 • 1

upvoted a collection 8 months ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 3 days ago • 545

upvoted an article 9 months ago

Article

Introducing the Open Chain of Thought Leaderboard

Apr 23, 2024

• 28

liked 5 models 9 months ago

liked a dataset 9 months ago

logikon/logikon-bench

Viewer • Updated Sep 30, 2024 • 3.21k • 87 • 6

reacted to ggbetz's post with ❤️ 9 months ago

Post

1441

🥇Open CoT Leaderboard

We're delighted to announce the [Open CoT Leaderboard]( logikon/open_cot_leaderboard) on 🤗 Spaces.

Unlike other LLM performance leaderboards, the Open CoT Leaderboard is not tracking absolute benchmark accuracies, but relative **accuracy gains** due to **chain-of-thought**.

Eval datasets that underpin the leaderboard are hosted [here](https://huggingface.co/cot-leaderboard).

Feedback and suggestions more than welcome.

@clefourrier

5 replies

liked a Space 9 months ago

Running on CPU Upgrade

🥇

Open CoT Leaderboard

Track, rank and evaluate open LLMs' CoT quality

liked a model 10 months ago

xai-org/grok-1

Text Generation • Updated Mar 28, 2024 • 312 • 2.21k