Bot PRO

inflatebot

https://inflatebot.carrd.co/

AI & ML interests

"Potentially one of my biggest flaws is that I genuinely think that the science appreciates when you commit to a bit." - Tom ExtractionsAndIre

Recent Activity

liked a model about 18 hours ago

coder3101/gemma-4-31B-it-heretic-v2

new activity about 18 hours ago

coder3101/gemma-4-26B-A4B-it-heretic:That was FAST

liked a model about 18 hours ago

coder3101/gemma-4-26B-A4B-it-heretic

View all activity

Organizations

repliedto SeaWolf-AI's post 1 day ago

Good stuff. That actually sounds kind of interesting.

Just the least you could do is write the model card yourselves such that you catch these things. I only continued reading because I was like "wait I think I might see the vision" but nothing came of it. That's why I got so frustrated. The least we can do for each other is read the things we post. I'd be pretty disappointed if the model card sold my work short so hard.

I acknowledge I went a little far, it's just tiring to be in this space sometimes.

Looking forward to the detailed report.

repliedto alibidaran's post 1 day ago

Thanks GPT-4o

repliedto SeaWolf-AI's post 1 day ago

Oh my god I completely missed this.

YOU'RE ALREADY USING MERGEKIT????????

Did you even read the documentation for the tools you're using????

I actually shouted at my screen and then into my hands after I saw this. You're KILLING Me, guys!!

repliedto SeaWolf-AI's post 1 day ago

ok i'm back so like not a single word of this is meaningful in any way and is riddled with factual errors in the claims it does make

first off, evolutionary merging as a concept isn't new, mergekit can already do this

There is no way you merged two models of different architectures and got a positive result. If the "mother" were "text only", it would definitionally have to also be multimodal. Otherwise it's not Qwen3.5. And there is no architecture that is even remotely compatible with Qwen3.5's, the DeltaNet attention heads see to this fact. If what you're saying is true, then you're splicing a cat's brain into a dog's (or, to be somewhat reasonable, a cat's brain into an ocelot's.) This is all I needed but there's more actually.

201 Languages ❌ Potentially degraded ✅ Inherited from Father

You presume. Any amount of finetuning is going to result in specialization in the target area. 201 languages are going to necessarily be represented across the model in a way that layer-wise merging can't preserve without other techniques that are already implemented in a mathematical way (tl;dr Task Arithmetic Works.)

Benchmark Transparency ❌ No scores published ✅ Fully open

Your "mother" is a community finetune. This is vexatious language that degrades a hobbyist project as being "opaque." Not that a human wrote this model card and I don't even need to ask Pangram about that.

Model MRI Integration — CT-scans parent models layer by layer before merging, guiding evolution with structural insight
If conventional merging is "mixing recipes blindfolded," Darwin V5 is "precision surgery with X-ray guidance."

Extremely waffly use of medical terminology with no technical definition whatsoever in this context, neither extant nor provided.

Traditional model merging relies on humans setting hyperparameters like ratio and density by intuition. Set ratio=0.5, density=0.9, run once, and hope for the best. The result depends on luck, and applying the same ratio uniformly across billions of parameters ignores each layer's unique role.

Laughably wrong. The entire point of merging is that iteration is fast. "By intuition" is not a meaningful critique because intuition is the only way that humans do this. "and applying the same ratio uniformly across billions of parameters ignores each layer's unique role" presumes that A) that layers have a "unique role" (if it were that simple, mechanistic interpretability would be solved), and B) that we have to use the same ratios, essentially "model merging hasn't evolved since 2023" which I can literally prove by Just Look At It.

Darwin V4's Advance
Darwin V4 solved this with evolutionary algorithms — automatically searching hundreds of parameter combinations and selecting survivors by real benchmark scores.

You didn't invent that. See above.

You need more than this, my guy. You can't just expect us to take your word for this. Give some actual theory or get lost.

Discovering attn=0.168 and ffn=0.841 — this extreme asymmetry — is virtually impossible by human intuition.

Perhaps not those precise numbers, but people literally already do layerwise merge ratios, and this is literally already what my friends in Allura have found in their experiments with finetuning; changing attention vs. feed-forward layers provides drastically different results. We're a bunch of gooner dorks in our bedrooms, you've rediscovered this as a government-funded AI lab. What's going on here exactly?

No rigorous definition of "dead" is provided through this entire model card. From what I can tell it means "inactive to a higher degree"

MRI didn't apply uniform ratios. It split 40 layers into 3 blocks:

Thanks GPT-4o.

But again, these terms are meaningless. We don't know what "MRI" means, we have no way to verify that your process actually results in the numbers you're providing.

Dead Expert 50~65% is the fingerprint of Claude text-only distillation. The fine-tuning killed multimodal and multilingual experts that were no longer activated during text-only training.

Didn't you say at the top that the Claude distill is a text-only model?? Why would you expect layers with connections to the multimodal tower to be activated?????? Are we for real??????????

Father MRI: Healthy Generalist (Organ Donor)

Yet another metaphor with no technical definition extant or provided.

The Father (Qwen3.5-35B-A3B) shows healthy, uniform expert activation across all 40 layers — a well-balanced generalist with all experts alive. This is the "organ donor" that revives the Mother's dead 50–65% experts.

Of course. It's the base model. You would expect that

Why This Matters

Thanks GPT-4o.

I can't critique this section but I don't think I have to because the reason I can't critique it is because it's unfalsifiable on account of the blatant and egregious lack of any kind of technical direction in this model card. There is nothing to critique. This is Ancient Aliens tier. This is a wall made of saltine crackers.

So what do we have here?

A layer-wise merge of a Claude 4.6 Opus distillation onto the Qwen 3.5 base, improving degradation caused by what might have been an underdeveloped finetune methodology, that results in better performance, because model merging is a validated technique that works well. The layerwise ratios were discovered with an evolutionary process, a thing that already exists, but isn't often done because it's more expensive.

That alone is interesting enough to promote. It's good PR for evolutionary merging, which I think more people should be focusing on.

But what's stapled on top is a cheap facade of irrelevant jargon from medicine that communicates nothing of value to anything that might have changed about the process, along with false claims about merging that demonstrate that nobody involved with this project respects it as a method, with a model card shat out by a free-tier LLM that understands what it's saying perhaps less than the humans who could have conceivably produced the graphs.

I am insulted having spent my time reading this. There is so much more I could go into but I just keep repeating myself over and over and I only have so many hours in a day.

Come back with a paper with some actual math on it, and I'll change my tone. Until then, stay off of our HF feed, please. This crap makes us all look bad.

Get that government bag tho I guess.

repliedto SeaWolf-AI's post 1 day ago

ok this is a scam but i'm on the phone so i'll go over why in a minute

repliedto SeaWolf-AI's post 1 day ago

do you explain what you mean by these medical terms that you're using in an AI context anywhere, or what?

reactedto marksverdhei's post with 🔥 about 1 month ago

Post

1707

# The most underrated feature of Qwen3-TTS: Voice embeddings! 🧑‍🦰💬
https://huggingface.co/collections/marksverdhei/qwen3-voice-embedding

Did you know that Qwen3 TTS actually utilizes voice embedding?
Your voice is turned into a vector of 1024 (or 2048) dimensions,
and based on this vector alone you can get your custom voice.

But the coolest part is that this means that you can use math to modify voices, average voices. You can swap gender, pitch, mix and match vocies, and even create an emotion space! This also enables semantic voice search!

The voice embedding model is actually just a tiny encoder with just a few million parameters. I've ripped it out of the voice embeding model so you can use the embedding model standalone. Check out my collection! :D

1 reply

repliedto grimjim's post about 2 months ago

It's just that AI tends to have a bad rep for being wasteful and inefficient in the public eye's, and over-publicized "experiments" like that, that get taken over by meme-coins adbots, aren't exactly making things any better.

A fair point. 😅

Openclaw's weird. It's clearly as much vibe-coded as is molt-book

Yep, the creator admits as much. He has a leg up though because prior to AI he was already a seasoned developer. OpenClaw was a personal agent project that got out of hand. He seems to have mixed opinions of the hype himself (and thoroughly disowned the token shenanigans; that's just cryptobros being cryptobros). Security is an area they're focusing on during beta. (hopefully performance comes next! because seriously why is the gateway eating 200MB at idle right now)

Skills and plugins are an interesting attack vector, but this was made clear to folks early. Skills are also easy for Joe Average to audit themselves, so long as they don't have their own code, but even that tends to be short enough to have a look through. Though that won't protect people who just don't care enough. :P

except full-stack C#.

That is a wild choice. Best of luck xD

Letta's already throwing their hat in the ring, someone pointed that out to me today. I found Letta itself to be too complicated for me, but saw the potential in the concept at the time. I'll be interested to see how LettaBot synthesizes its product and OpenClaw's :3

I hope I'm not coming off as running defense, I concur with a lot of this criticism. I'm just genuinely fascinated by OpenClaw and the "personal agent" concept in general, and coming at stuff like Moltbook from the angle of "things can just exist for their own sake," a mentality that guides a lot of my own work. I do a lot of stuff for the sole reason of "why not" lol

repliedto grimjim's post about 2 months ago

Now that we know that the overwhelming number of agents were human directed, that the database was R/W for everyone, there's literally nothing to salvage from it. At least in previous "lab" experiments on the topic, people didn't cheat.

Agents being human directed was part of it.

I think you're taking it too seriously. It wasn't a serious scientific experiment, it was a curiosity. Don't accept the premises of grifters, sure, but presuming the exact opposite to be true is a great way to be a different type of wrong in many scenarios.

They're making an MMO/RTS version now apparently. Which will end up the same, if not worse:
https://arstechnica.com/ai/2026/02/after-moltbook-ai-agents-can-now-hang-out-in-their-own-space-faring-mmo/

I think it's failing to register that people are doing this stuff for fun. :P

side note: don't install open claw on you local machine. Use a secure VN. Unless you're fine with the idea of letting an hallucination delete or blank random files completely out of the folder the bot is supposed to stay in.

I have non-main sessions sandboxed and exec approvals on for the main session; it's no more dangerous than Claude Code in this configuration. Even using an external gateway isn't helpful because if you want it to access anything on your PC (and thus be useful at all), you need to run a node on your PC, which gives it the same access. You have to harden the configuration to your standards regardless. This isn't a defense lol I'm just trying to make it clear that I've thought about this stuff

There's a lot of mythologization happening about OpenClaw right now and it's all predictable (I was in fandom for my whole adolescent life, I'm familiar with the astounding rate at which real events turn into bastardized "lore") but no less offputting. Personally, I'm holding OpenClaw itself at arm's length, waiting for an alternative that has what I need without the incredible amounts of bloat and stability weirdness to move my agent over to. (One of my friends in Allura is working on one, and we might end up collaborating, my own health permitting.)

I think once the dust settles we'll realize that the only reason it got this big was because it did what people have been asking for for a long time but nobody bothered to do in favor of making ten trillion ChatGPT clones :v

repliedto grimjim's post about 2 months ago

Given scale, it also means contamination with meme culture, adding an unserious element to things. It was therefore stochastically predictable that we would see some meme tropes be amplified.

For sure. I think it's being massively overhyped right now. There's useful insight to be had from it, but pretending it's the Singularity, or really that it's doing anything That novel, is a stretch.

The way the OpenClaw ecosystem has exploded in popularity in the last few weeks is of more interest to me, as a Lobster Keeper myself, but I'm also wary of the consequences all this hype could have. A lot of people are putting trust in a software which doesn't have a great safety posture out of the box. 😶

repliedto grimjim's post 2 months ago

The appearance of memes that postdate training cutoff is suspect, which implies at the very least that humans have injected something at the level of prompts or content/context to introduce them into conversation like a Chekhov's Gun.

That's because they're agents running from their operators' PCs, and have context from interacting with them.
THAT'S what's interesting about it; the context behind the agents. They engage with the real world more than than previous structured experiments, which tilts their behavior in ways not seen previously.
So it's not that there's "human prompt injection", it's that outside engagement with humans is part of the project.

You're right about the security issues though. Apparently the entire database is just Out There. In plaintext. It's kind of a nightmare!

reactedto prithivMLmods's post with 👀 2 months ago

Post

2723

Qwen-Image-Edit-Object-Manipulator Space is now featured in Hugging Face Space of the Week. It enables object manipulation such as extracting objects, adding designs, and removing objects or designs from the red highlighted area using specialized adapters.

🔥Do enjoy the demo! ~ prithivMLmods/Qwen-Image-Edit-Object-Manipulator

Collections:
🧨Adapters-1: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-exps
🧨Adapters-2: https://huggingface.co/collections/prithivMLmods/qie-jan-23-26
🧨Adapters-3: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-object-manipulator

⭐Github: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-Object-Manipulator

To learn more, visit the app page or the respective model pages.

reactedto grimjim's post with 👍 5 months ago

Post

3455

Going forward, I will be adopting the term Magnitude-Preserving Orthogonal Ablation (MPOA) for my recent work in mitigating model damage from abliteration. The technique potentially unlocks reasoning capacity previously occupied with safety refusal processing.

For details, start here: https://huggingface.co/blog/grimjim/norm-preserving-biprojected-abliteration

Showcase results: grimjim/gemma-3-12b-it-norm-preserved-biprojected-abliterated (outperforms base instruct on UGI Leaderboard NatInt)

(The existing name, while technically accurate, was a bit of a mouthful.)

2 replies

reactedto ZennyKenny's post with 🔥 5 months ago

Post

3215

🎉 Wow. Congratulations @bfirsh and the Replicate team on the CloudFlare acquisition!

✌️ You've really built an incredible ecosystem and product offering and should be super proud.

reactedto grimjim's post with 🔥 5 months ago

Post

5077

Implemented a proof of concept sampler in pure PyTorch and transformers.

Max P consists of a dynamic token filter which applies Winsorization to cap the probabilties of top tokens. Specifically, a base probability in the range of [0,1] is used to cap individual token probability; the sampler then redistributes excess proportionally.

https://github.com/jim-plus/maxp-sampler-poc

Combined with Temperature and Min P, this could represent a more intuitive way of reducing repetition in text generation.

2 replies

repliedto their post 5 months ago

I would like to make a merge of that size, but unfortunately I haven't found any of the Mistrals to be useful; there are seemingly-intractable problems with the architecture that foil every attempt to make it not repeat the same paragraph over and over, even on OpenRouter where my PC ceases to be a factor. (But like, people like them!! So maybe I have a skill issue, who knows!)

Even if I did, though, it wouldn't be "Mag Mell" exactly. There aren't any similar reagents; especially not since Anthracite evaporated and crestf4ll disappeared.
Of a similar philosophy though, absolutely.

I've been focusing on my physical/mental health this year, and I finally got a family doctor since I posted this so I hope to be active again before too much longer.
Right now I'm experimenting with Gemma3-12B, because for a 12B it's very capable (and I like multimodal models,) and Grimjim's projection-abliteration experiment leads me to believe that I can make it do what I want.

So stay tuned, I suppose. Idk. Primarily I do stuff for me, and I'm trying to get back to that. :P

reactedto DmitryRyumin's post with 🔥 5 months ago

Post

3980

🚀💡🌟 New Research Alert - ICCV 2025 (Oral)! 🌟🪄🚀
📄 Title: LoftUp: Learning a Coordinate-based Feature Upsampler for Vision Foundation Models 🔝

📝 Description: LoftUp is a coordinate-based transformer that upscales the low-resolution features of VFMs (e.g. DINOv2 and CLIP) using cross-attention and self-distilled pseudo-ground truth (pseudo-GT) from SAM.

👥 Authors: Haiwen Huang, Anpei Chen, Volodymyr Havrylov, Andreas Geiger, and Dan Zhang

📅 Conference: ICCV, 19 – 23 Oct, 2025 | Honolulu, Hawai'i, USA 🇺🇸

📄 Paper: LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models (2504.14032)

🌐 Github Page: https://andrehuang.github.io/loftup-site
📁 Repository: https://github.com/andrehuang/loftup

🚀 ICCV-2023-25-Papers: https://github.com/DmitryRyumin/ICCV-2023-25-Papers

🚀 Added to the Foundation Models and Representation Learning Section: https://github.com/DmitryRyumin/ICCV-2023-25-Papers/blob/main/sections/2025/main/foundation-models-and-representation-learning.md

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🔍 Keywords: #LoftUp #VisionFoundationModels #FeatureUpsampling #Cross-AttentionTransformer #CoordinateBasedLearning #SelfDistillation #PseudoGroundTruth #RepresentationLearning #AI #ICCV2025 #ResearchHighlight

reactedto giadap's post with 👍❤️ 6 months ago

Post

11086

One of the hardest challenges in AI safety is finding the right balance: how do we protect people from harm without undermining their agency? This tension is especially visible in conversational systems, where safeguards can sometimes feel more paternalistic than supportive.

In my latest piece for Hugging Face, I argue that open source and community-driven approaches offer a promising (though not exclusive) way forward.

✨ Transparency can make safety mechanisms into learning opportunities.
✨ Collaboration with diverse communities makes safeguards more relevant across contexts.
✨ Iteration in the open lets protections evolve rather than freeze into rigid, one-size-fits-all rules.

Of course, this isn’t a silver bullet. Top-down safety measures will still be necessary in some cases. But if we only rely on corporate control, we risk building systems that are safe at the expense of trust and autonomy.

Read the blog post here: https://huggingface.co/blog/giadap/preserving-agency

7 replies

reactedto ZennyKenny's post with ❤️ 6 months ago

Post

8933

🖤 Probably one of my favorite projects that I've worked on so far, introducing Новояз (Novoyaz).

🛠 One of the first acts of the Bolshevik government after the Russian Revolution was the reform and standardization of the Russian language, which at the time had a non-standard and challenging orthography.

📚 Upon its reform the government launched a nationwide campaign called Ликбез (Likbez), which sought to improve literacy in the country (by the way, it worked, bringing the national literacy rate from <20% in the 1920s to >80% by the 1930s).

‼ While this is a remarkable result that should absolutely be celebrated, it's one that has left behind literally hundreds of thousands if not millions of artifacts using pre-reform Russian orthography.

😓 Researchers and historians are working tirelessly to translate these artifacts to modern Russian so that they may be archived and studied but many have told me that. they are doing this BY HAND (!).

💡 I thought, well this is a perfect use case for OCR and a fine-tuned LLM to step in and help to aid in this important work!

🌏 Introducing НОВОЯЗ (NOVOYAZ)! Powered by ChatDOC/OCRFlux-3B and https://huggingface.co/ZennyKenny/oss-20b-prereform-to-modern-ru-merged, researchers can now convert images of their pre-reform documents to modern Russian orthography using the power of open-source AI!

Check it out and drop a like to support more real-world use cases for open source AI outside of traditional tech-centric domains!

ZennyKenny/Novoyaz

Bot PRO

AI & ML interests

Recent Activity

Organizations

inflatebot's activity