DJ Sri Vigneshwar's picture

DJ Sri Vigneshwar

Sri-Vigneshwar-DJ

AI & ML interests

Currently building Hawky.ai - Creative Intelligence for Performance Marketing

Recent Activity

Articles

Organizations

AI FILMS's profile picture GEM benchmark's profile picture MusicAI's profile picture OpenVINO Toolkit's profile picture Open-Source AI Meetup's profile picture East China Normal University's profile picture AI Zero to Hero's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture Blog-explorers's profile picture AI Tamil Nadu's profile picture LocalLLaMA's profile picture MLX Community's profile picture C4AI Community's profile picture M4-ai's profile picture Chinese LLMs on Hugging Face's profile picture Paris AI Running Club's profile picture Hawky.ai - The Creative Analytics Platform's profile picture Hawky.ai - Fine-tuned Language and Creative Generation Models (MarTech) 's profile picture Intelligent Estate's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture Arracle AI's profile picture

Posts 3

view post
Post
519
Checkout phi-4 from Microsoft, dropped a day ago... If you ❤️ the Phi series, then here is the GGUF - Sri-Vigneshwar-DJ/phi-4-GGUF. phi-4 is a 14B highly efficient open LLM that beats much larger models at math and reasoning - check out evaluations on the Open LLM.

Technical paper - https://arxiv.org/pdf/2412.08905 ; The Data Synthesis approach is interesting
view post
Post
2008
Just sharing a thought: I started using DeepSeek V3 a lot, and an idea struck me about agents "orchestrating during inference" on a test-time compute model like DeepSeek V3 or the O1 series.

Agents (Instruction + Function Calls + Memory) execute during inference, and based on the output decision, a decision is made to scale the time to reason or perform other tasks.