π’•π’‚π’π’—π’Šπ’“'s picture

π’•π’‚π’π’—π’Šπ’“

Tanvir1337

AI & ML interests

Deep Learning, Generative Adversarial Networks, Transformer, Diffusion, SOTA Foundation Models

Recent Activity

liked a model about 4 hours ago
NovaSky-AI/Sky-T1-32B-Preview
updated a collection about 20 hours ago
Spaces
liked a Space about 20 hours ago
Pendrokar/TTS-Spaces-Arena
View all activity

Organizations

Stanford AI's profile picture AI FILMS's profile picture Samsung Electronics's profile picture MISATO-dataset's profile picture Masakhane NLP's profile picture GEM benchmark's profile picture LangChain Agents Hub's profile picture LangChain Chains Hub's profile picture OpenGVLab's profile picture MusicAI's profile picture BigScience Biomedical Datasets's profile picture LangChainDatasets's profile picture fast.ai community's profile picture OpenVINO Toolkit's profile picture LLMs's profile picture ONNXConfig for all's profile picture Gradio-Blocks-Party's profile picture scikit-learn's profile picture AMD's profile picture lora concepts library's profile picture DeepGHS's profile picture Open-Source AI Meetup's profile picture Arabic Machine Learning 's profile picture DataScienceGuild's profile picture Literally Me FRFR Research Society's profile picture East China Normal University's profile picture Pseudo Lab's profile picture Tune a video concepts library's profile picture LangChain Hub Prompts's profile picture Keras Dreambooth Event's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture The Waifu Research Department's profile picture AI Indonesia Community's profile picture Blog-explorers's profile picture OpenSky's profile picture CyberHarem's profile picture ICCV2023's profile picture Tensor Diffusion's profile picture ICML2023's profile picture OpenLLM France's profile picture huggingPartyParis's profile picture MultiπŸ€–Transformers's profile picture Team Tonic's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture SaprotHub's profile picture Project Fluently's profile picture LocalLLaMA's profile picture Bangla Large Language Model's profile picture MLX Community's profile picture Argilla Explorers's profile picture INNOVA AI's profile picture Narra's profile picture C4AI Community's profile picture M4-ai's profile picture takara.ai's profile picture Refine AI's profile picture Chinese LLMs on Hugging Face's profile picture Paris AI Running Club's profile picture Stable Diffusion Community (Unofficial, Non-profit)'s profile picture Hugging Face for Legal's profile picture Hugging Face Discord Community's profile picture mekasiu's profile picture

Tanvir1337's activity

reacted to Severian's post with ❀️ 2 days ago
view post
Post
3663
Interesting Solution to the Problem of Misguided Attention

So I've been fascinated by the problem of Misguided Attention for a few weeks. I am trying to build an inference algorithm to help LLMs address that issue; but in the process, I found a cool short-term fix I call "Mindful Attention" using just prompt-engineering.

Have you ever thought about how our brains filter reality through layers of past experiences, concepts, and mental images? For example, when you look at an oak tree, are you truly seeing that oak tree in all its unique details, or are you overlaying it with a generalized idea of "oak tree"? This phenomenon inspired the new approach.

LLMs often fall into a similar trap, hence the Misguided Attention problem. They process input not as it’s uniquely presented but through patterns and templates they’ve seen before. This leads to responses that can feel "off," like missing the point of a carefully crafted prompt or defaulting to familiar but irrelevant solutions.

I wanted to address this head-on by encouraging LLMs to slow down, focus, and engage directly with the inputβ€”free of assumptions. This is the core of the Mindful Attention Directive, a prompt designed to steer models away from over-generalization and back into the moment.

You can read more about the broader issue here: https://github.com/cpldcpu/MisguidedAttention

And if you want to try this mindful approach in action, check out the LLM I’ve set up for testing: https://hf.co/chat/assistant/677e7ebcb0f26b87340f032e. It works about 80% of the time to counteract these issues, and the results are pretty cool.

I'll add the Gist with the full prompt. I admit, it is quite verbose but it's the most effective one I have landed on yet. I am working on a smaller version that can be appended to any System Prompt to harness the Mindful Attention. Feel free to experiment to find a better version for the community!

Here is the Gist: https://gist.github.com/severian42/6dd96a94e546a38642278aeb4537cfb3