2 min readfrom Machine Learning

[P] Visualizing LM's Architecture and data flow with Q subspace projection

[P] Visualizing LM's Architecture and data flow with Q subspace projection
[P] Visualizing LM's Architecture and data flow with Q subspace projection

Hey guys, I did something hella entertaining. With some black magic and vodoo I was able to extract pretty cool images that are like an MRI from the model. I'm not stating anything, I have some hypothesis about it... It is mostly because it is just so pretty and mind bogging.

I stumbled up a way to visualize LM's structure of structure structures in a 3D volume.

Here is the Gist Link with a speed run of the idea.

Some images:

y3i12/Prisma (my research model)

Qwen/Qwen3.5-0.8B

HuggingFaceTB/SmolLM-360M

RWKV/rwkv-4-430m-pile

state-spaces/mamba-370m-hf

At the present moment I'm looking for a place where I can upload the interactive HTML. If you know of something, let me know that I'll link them. It is very much a lot mesmerizing to keep looking at them at different angles.

The mediator surface that comes out of this is also pretty interesting:

https://preview.redd.it/zbbvba1m9mqg1.png?width=749&format=png&auto=webp&s=48f2a44273bdba30176b89d8057c0e9880cb9401

I wonder if this one of many possible interpretations of "loss landscape".

submitted by /u/y3i12
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#financial modeling with spreadsheets
#generative AI for data analysis
#rows.com
#Excel alternatives for data analysis
#big data management in spreadsheets
#conversational data analysis
#real-time data collaboration
#intelligent data visualization
#data visualization tools
#enterprise data management
#big data performance
#data analysis tools
#data cleaning solutions
#natural language processing for spreadsheets
#interactive charts
#visualization
#architecture
#LM
#Q subspace projection
#data flow