4 15 185

DJ Sri Vigneshwar

Sri-Vigneshwar-DJ

https://hawky.ai/

AI & ML interests

Currently building Hawky.ai - Creative Intelligence for Performance Marketing

Recent Activity

posted an update 1 day ago

Checkout phi-4 from Microsoft, dropped a day ago... If you ❤️ the Phi series, then here is the GGUF - https://huggingface.co/Sri-Vigneshwar-DJ/phi-4-GGUF. phi-4 is a 14B highly efficient open LLM that beats much larger models at math and reasoning - check out evaluations on the Open LLM. Technical paper - https://arxiv.org/pdf/2412.08905 ; The Data Synthesis approach is interesting

updated a model 1 day ago

Sri-Vigneshwar-DJ/phi-4-GGUF

liked a dataset 1 day ago

cfahlgren1/react-code-instructions

View all activity

Articles

Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️

7 days ago

• 4

Organizations

Posts 3

Post

519

Checkout phi-4 from Microsoft, dropped a day ago... If you ❤️ the Phi series, then here is the GGUF - Sri-Vigneshwar-DJ/phi-4-GGUF. phi-4 is a 14B highly efficient open LLM that beats much larger models at math and reasoning - check out evaluations on the Open LLM.

Technical paper - https://arxiv.org/pdf/2412.08905 ; The Data Synthesis approach is interesting

Post

2008

Just sharing a thought: I started using DeepSeek V3 a lot, and an idea struck me about agents "orchestrating during inference" on a test-time compute model like DeepSeek V3 or the O1 series.

Agents (Instruction + Function Calls + Memory) execute during inference, and based on the output decision, a decision is made to scale the time to reason or perform other tasks.

View all posts