Paraskevi Kivroglou

KvrParaskevi

Paraskevi-KIvroglou

AI & ML interests

I am looking forward into a world full of AI innovation. By having small ideas in new projects, I want to take the next step and give them life.

Recent Activity

liked a dataset 1 day ago

semeru/code-text-python

liked a dataset 4 days ago

CodeEval-Pro/mbpp-pro

liked a dataset 5 days ago

m-ric/huggingface_doc

View all activity

Organizations

KvrParaskevi's activity

liked a dataset 1 day ago

semeru/code-text-python

Viewer • Updated Mar 23, 2023 • 281k • 199 • 7

liked a dataset 4 days ago

CodeEval-Pro/mbpp-pro

Viewer • Updated 11 days ago • 378 • 17 • 2

liked 2 datasets 5 days ago

m-ric/huggingface_doc

Viewer • Updated Jan 9, 2024 • 2.65k • 1.87k • 11

m-ric/agents_medium_benchmark_2

Viewer • Updated 15 days ago • 142 • 142 • 7

liked a dataset 7 days ago

code-search-net/code_search_net

Updated Jan 18, 2024 • 3.51k • 278

liked a model 8 days ago

jinaai/jina-embeddings-v2-base-code

Feature Extraction • Updated 5 days ago • 61.5k • 77

liked a dataset 21 days ago

evalplus/mbppplus

Viewer • Updated Apr 17, 2024 • 378 • 28.8k • 8

liked a dataset 27 days ago

BAAI/TACO

Updated Jun 19, 2024 • 1.18k • 77

upvoted a paper about 2 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 140

liked a model about 2 months ago

yiyanghkust/finbert-tone

Text Classification • Updated Oct 17, 2022 • 357k • 162

liked a dataset about 2 months ago

ibm/finqa

Updated Jun 6, 2024 • 843 • 3

liked a dataset 2 months ago

rajpurkar/squad_v2

Viewer • Updated Mar 4, 2024 • 142k • 17.6k • 189

liked a model 2 months ago

foduucom/stockmarket-pattern-detection-yolov8

Object Detection • Updated Sep 11, 2023 • 24.6k • 229

reacted to reach-vb's post with 🚀 2 months ago

Post

2995

Smol models ftw! AMD released AMD OLMo 1B - beats OpenELM, tiny llama on MT Bench, Alpaca Eval - Apache 2.0 licensed 🔥

> Trained with 1.3 trillion (dolma 1.7) tokens on 16 nodes, each with 4 MI250 GPUs

> Three checkpoints:

- AMD OLMo 1B: Pre-trained model
- AMD OLMo 1B SFT: Supervised fine-tuned on Tulu V2, OpenHermes-2.5, WebInstructSub, and Code-Feedback datasets
- AMD OLMo 1B SFT DPO: Aligned with human preferences using Direct Preference Optimization (DPO) on UltraFeedback dataset

Key Insights:
> Pre-trained with less than half the tokens of OLMo-1B
> Post-training steps include two-phase SFT and DPO alignment
> Data for SFT:
- Phase 1: Tulu V2
- Phase 2: OpenHermes-2.5, WebInstructSub, and Code-Feedback

> Model checkpoints on the Hub & Integrated with Transformers ⚡️

Congratulations & kudos to AMD on a brilliant smol model release! 🤗

amd/amd-olmo-6723e7d04a49116d8ec95070

replied to qq8933's post 2 months ago

Awesome work. Can we finetune further this reasoning model?

reacted to qq8933's post with 👍 2 months ago

Post

6389

LLaMA-O1: Open Large Reasoning Model Frameworks For Training, Inference and Evaluation With PyTorch and HuggingFace
Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models!
https://github.com/SimpleBerry/LLaMA-O1/

What will happen when you compound MCTS ❤ LLM ❤ Self-Play ❤RLHF?
Just a little bite of strawberry!🍓

Past related works:
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)

2 replies

upvoted a paper 2 months ago

Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B

Paper • 2406.07394 • Published Jun 11, 2024 • 26

liked a model 2 months ago

nroggendorff/smallama

Text Generation • Updated 22 days ago • 502 • 6

reacted to nroggendorff's post with 👀 2 months ago

Post

2653

When huggingface patches this, I'm going to be really sad, but in the meantime, here you go:

When AutoTrain creates a new space to train your model, it does so via the huggingface API. If you modify the code so that it includes a premade README.md file, you can add these two lines:

---
app_port: 8080 # or any integer besides 7860 that's greater than 2 ** 10
startup_duration_timeout: 350m
---

This will tell huggingface to listen for the iframe on your port, instead of the one autotrain is actually hosting on, and because startup time isn't charged, you get the product for free. (you can take this even further by switching compute type to A100 or something)

1 reply

liked a dataset 2 months ago

ajibawa-2023/Software-Architecture

Preview • Updated Oct 28, 2024 • 42 • 19