Robert Sinclair

ZeroWw

AI & ML interests

LLMs optimization (model quantization and back-end optimizations) so that LLMs can run on computers of people with both kidneys. Discord: https://discord.com/channels/@robert_46007

Recent Activity

Organizations

Blog-explorers's profile picture Robert Sinclair's profile picture

ZeroWw's activity

New activity in inflatebot/MN-12B-Mag-Mell-R1 about 1 month ago

Great model! Smaller perhaps?

1
#10 opened about 1 month ago by
ZeroWw
New activity in ggerganov/whisper.cpp about 1 month ago

Please add the medium-it model

#22 opened about 1 month ago by
ZeroWw
New activity in deepseek-ai/DeepSeek-V2 about 1 month ago
replied to TuringsSolutions's post 3 months ago
view reply

hence my idea of the SILLY versions... ;)

replied to TuringsSolutions's post 3 months ago
view reply

I am pretty sure that the actual models "AS THEY ARE" could perform 10 times better using chain of thought and some algorithms like these. Without needing a different training. And I think that's probably what CLAUDE does,

reacted to TuringsSolutions's post with ❤️ 3 months ago
view post
Post
2106
Transformers are not all we need, that is being proven repeatedly now as more alternative frameworks emerge. Another such framework is Kolmogorov Arnold Network based Transformers. I break down exactly how these differ from Perceptron based Transformers and give you the link to my Colab where I create a model based on the research paper that absolutely destroys a standard Transformers based model. Check out the video here: https://www.youtube.com/watch?v=Sw0euxNZCc4
reacted to TuringsSolutions's post with ❤️ 3 months ago
view post
Post
1414
I think Reinforcement Learning is the future, for a lot of reasons. I spell them out for you in this video, and also provide you with the basic code to get up and running with Atari and OpenAI Gym. If you want to get into RL, this is your ticket. Link to a cool training montage of the model in the description of the video as well. Step 2 from here would be the full-on training and certification that HuggingFace offers for RL.

https://youtu.be/ueZl3A36ZQk
New activity in TuringsSolutions/Phi3Unlocked 3 months ago