42 23 828

Beckett Dillon PRO

Severian

https://home.interwoven-arkitech.com/

AI & ML interests

I make music, teach machines, study nature, and build things.

Recent Activity

updated a Space 25 minutes ago

Severian/Potential-Made-Simple

liked a Space about 1 hour ago

fantaxy/FLUX-Animations

liked a Space about 3 hours ago

Remsky/Kokoro-TTS-Zero

View all activity

Articles

Powering the Future: Be.Ta Labs’ Revolutionary 100% Solar-Powered AI Operation

Aug 18, 2024

Organizations

Severian's activity

updated a Space 25 minutes ago

Running

⚡

Potential Made Simple

Life System and Habit Tracker

liked a Space about 1 hour ago

Running on Zero

104

📊

FLUX Animation Creator

Flux Animations(GIF) Generaion

liked a Space about 3 hours ago

Running on Zero

103

🎴

Kokoro TTS Zero

Accelerated Text-To-Speech on Kokoro-82M

liked a model about 5 hours ago

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated about 21 hours ago • 70 • 27

liked a Space about 6 hours ago

Running on L4

190

📚

MinerU

liked a Space about 13 hours ago

Running on Zero

293

❤️

Kokoro TTS

Now in 5 languages!

liked a model about 13 hours ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 5 days ago • 8.1k • 633

liked a Space about 23 hours ago

Running

🐢

Url Scrape

I scrape web articles

liked 5 Spaces 1 day ago

Running on Zero

😻

SORA 3D

Create top-quality 3D(.GLB) models from text or images

Running

669

🦀

Gemini UI Generator

Text-to-UI - Thanks to Pietro Schirano for the Repository!

Running on L4

122

⚡

Stable Point-Aware 3D

liked 4 Spaces 2 days ago

Configuration error

🔎

Attention Visualization

Vision Transformer Attention Visualization

Running on CPU Upgrade

🐠

TransPixar

https://huggingface.co/papers/2501.03006

posted an update 3 days ago

Post

3652

Interesting Solution to the Problem of Misguided Attention

So I've been fascinated by the problem of Misguided Attention for a few weeks. I am trying to build an inference algorithm to help LLMs address that issue; but in the process, I found a cool short-term fix I call "Mindful Attention" using just prompt-engineering.

Have you ever thought about how our brains filter reality through layers of past experiences, concepts, and mental images? For example, when you look at an oak tree, are you truly seeing that oak tree in all its unique details, or are you overlaying it with a generalized idea of "oak tree"? This phenomenon inspired the new approach.

LLMs often fall into a similar trap, hence the Misguided Attention problem. They process input not as it’s uniquely presented but through patterns and templates they’ve seen before. This leads to responses that can feel "off," like missing the point of a carefully crafted prompt or defaulting to familiar but irrelevant solutions.

I wanted to address this head-on by encouraging LLMs to slow down, focus, and engage directly with the input—free of assumptions. This is the core of the Mindful Attention Directive, a prompt designed to steer models away from over-generalization and back into the moment.

You can read more about the broader issue here: https://github.com/cpldcpu/MisguidedAttention

And if you want to try this mindful approach in action, check out the LLM I’ve set up for testing: https://hf.co/chat/assistant/677e7ebcb0f26b87340f032e. It works about 80% of the time to counteract these issues, and the results are pretty cool.

I'll add the Gist with the full prompt. I admit, it is quite verbose but it's the most effective one I have landed on yet. I am working on a smaller version that can be appended to any System Prompt to harness the Mindful Attention. Feel free to experiment to find a better version for the community!

Here is the Gist: https://gist.github.com/severian42/6dd96a94e546a38642278aeb4537cfb3

liked a model 3 days ago

microsoft/phi-4

Text Generation • Updated 3 days ago • 35.9k • 958

liked a Space 4 days ago

Running

🌍

Gemini2 Flash Thinking

Implement Gemini2 Flash Thinking model with Gradio

Beckett Dillon PRO

AI & ML interests

Recent Activity

Articles

Powering the Future: Be.Ta Labs’ Revolutionary 100% Solar-Powered AI Operation

Organizations

Severian's activity

Potential Made Simple

FLUX Animation Creator

Kokoro TTS Zero

NovaSky-AI/Sky-T1-32B-Preview

MinerU

Kokoro TTS

hexgrad/Kokoro-82M

Url Scrape

MASt3r+3DGS

SORA 3D

Gemini Coder

Gemini UI Generator

Stable Point-Aware 3D

PGLite Semantic Search

Attention Visualization

Kotaemon Papers

TransPixar

microsoft/phi-4

Gemini2 Flash Thinking