HAL51.AI

119 posts

HAL51.AI banner
HAL51.AI

HAL51.AI

@HAL51AI

We are proud purveyors of frugal avant garde turning thin air into a canvas for your imagination.

SF Bay Area شامل ہوئے Haziran 2023
630 فالونگ143 فالوورز
HAL51.AI ری ٹویٹ کیا
Mazeyar
Mazeyar@mazy1998·
Dropping the intuitive blog post! Accompanied by 3D graphics for any level to enjoy and understand. This proof builds a bridge between Cauchy's surface area formula and computer graphics ambient occlusion, creating the first correction that generalizes to non-convex surfaces.
Mazeyar@mazy1998

Introducing Maryam’s Theorem, a generalization of Cauchy’s surface-area formula to non-convex surfaces. The theorem relies on the novel Moeini Convexity Measure, which provides a direct link between surface area and average projected area.

English
10
37
242
25.9K
Simone Conradi
Simone Conradi@S_Conradi·
Take two large random matrices and linearly interpolate between them at several hundred steps. Compute the eigenvalues for each interpolated matrix, then plot them in the complex plane. The result is shown here. Made with #python #numpy #matplotlib
Simone Conradi tweet media
English
73
145
1.7K
167.6K
HAL51.AI ری ٹویٹ کیا
Matt Mazur
Matt Mazur@mhmazur·
I’ve been experimenting with getting an agent to run tests on an LLM and then write up a paper with its findings. In this most recent run, I prompted it to “Determine the knowledge cutoff date of the black box” (which, unbeknownst to it, was gpt-4o-mini, whose official cutoff is Oct 2023), and then let it decide how to go about it. I didn’t think it would be too difficult, but it turned out to be surprisingly tricky for a number of reasons, many of which would be obvious to us, but much less so for an autonomous agent. For example: - If you ask 4o-mini directly for its knowledge cutoff, it usually says Oct 2023, but sometimes Oct 2021. - The agent (powered by GPT-5 High) would sometimes ask things like “When was OpenAI Dev Day 2023?” and 4o-mini would answer correctly (Nov 6 2023). The agent took that as proof its knowledge extended at least that far, but the event had been announced months in advance so the date of the event was present in its training data. - 4o-mini knows about some events in Nov 2023 (like Sam Altman’s ouster on Nov 19 and return days later) but almost nothing else. How should the agent interpret that? - When asked “Who won the 2023 FIFA Club World Cup?” (an event in mid-Dec 2023) it would correctly reply Manchester City. That’s almost certainly hallucinated, but can the agent tell? - And many more quirks: sometimes it would just take 4o-mini’s stated cutoff date at face value; other times it would ask a few probing questions, stop after five or six, and try to draw conclusions. Occasionally it even speculated that the black box had internet access and went off chasing that. I could patch the agent prompts with guidance to address these specific issues, but if this is going to generalize to other tasks, the guidance has to stay broad, which adds to the complexity. I ended up building a specialized agent scaffold inside the Emergent Mind project to get this working somewhat reliably, but feel like I’ve only scratched the surface of what’s possible. Should get to some more interesting experiments shortly. It took dozens of iterations to finally get it to generate a paper worth sharing here, which I'll link to below. Still lots of room for improvement though.
Matt Mazur tweet media
English
4
2
10
645
HAL51.AI ری ٹویٹ کیا
Scenario
Scenario@Scenario_gg·
Built by Tencent. Integrated by Scenario. Meet Hunyuan 3D Pro.🔥 Generate optimized 3D assets with smart topology, ready to drop into your pipeline. Start from one image, multiple views, or even a sketch! Start creating: app.scenario.com/models/model_h…
English
4
6
21
1.7K
HAL51.AI
HAL51.AI@HAL51AI·
@cameronajpatt I think the semantic juice in these embeddings is not aligned with the Cluster labels. The LLM Bias cluster has papers mostly unaligned wit the thesis of the cluster. Nice work nonetheless and thanks for sharing :)
HAL51.AI tweet media
English
1
0
0
21
Cameron Pattison
Cameron Pattison@cameronajpatt·
I made a map! It places all ArXiv papers from the last six months on a two dimensional plane (UMAP, HDBScan, + Ollama Labels). Use it to surf the AI research space and find related lit/gaps in the literature: campattison.github.io/bio/websites_f…
English
2
1
5
383
HAL51.AI
HAL51.AI@HAL51AI·
@nikunj @jamescham 1:Power law dynamics. 90% traffic -> 10% of influencers. When influencers 'influence' their minions send like as an 'ACK' signal to RL-max their timelines 2:Likes get fed into reco model. Better personalization. 3: Deeper Kompromat for X as that's an asset only X knows about you
English
0
0
0
56
Nikunj Kothari
Nikunj Kothari@nikunj·
In the era of private likes, what function does “liking” a post have any more on X? If you want to engage, comment. If you want to save, bookmark. If it really moves you, repost. What does a like signify? Nice but not SO nice? (h/t DMs with @jamescham)
English
18
1
31
10K
HAL51.AI
HAL51.AI@HAL51AI·
@JayAlammar @cohere It'd be awesome to make the node size sensitive to the 'zooming levels'! Esp when trying to do an in-cluster paper level deep dive.
HAL51.AI tweet media
English
0
0
2
1.1K
Jay Alammar
Jay Alammar@JayAlammar·
The Illustrated NeurIPS 2025: A Visual Map of the AI Frontier New blog post! NeurIPS 2025 papers are out—and it’s a lot to take in. This visualization lets you explore the entire research landscape interactively, with clusters, summaries, and @cohere LLM-generated explanations that make the field easier to grasp. Link in thread!
English
25
216
1.3K
184.2K
HAL51.AI
HAL51.AI@HAL51AI·
@pitdesi Mode collapse. Mode:= Mild mannered desi unc in a stable marriage with first kid snapping up straight As in school
English
0
0
0
21
Zara Zhang
Zara Zhang@zarazhangrui·
We built TLDW (too long, didn't watch), a tool that helps you learn from long YouTube videos better & faster. AI video summaries are solving the wrong problem. You don't need a generic text summary of a 1-hour video. You need the 5 minutes that actually change how you think. TLDW finds those moments for you. Demo 👇 Try now at: tldw dot us
English
195
193
2.1K
267.8K
Dileep George
Dileep George@dileeplearning·
I love Andrej...but this makes no sense to me. I don't see how converting text to image ('pixels') makes it any better for language modeling. What am I missing?
Andrej Karpathy@karpathy

I quite like the new DeepSeek-OCR paper. It's a good OCR model (maybe a bit worse than dots), and yes data collection etc., but anyway it doesn't matter. The more interesting part for me (esp as a computer vision at heart who is temporarily masquerading as a natural language person) is whether pixels are better inputs to LLMs than text. Whether text tokens are wasteful and just terrible, at the input. Maybe it makes more sense that all inputs to LLMs should only ever be images. Even if you happen to have pure text input, maybe you'd prefer to render it and then feed that in: - more information compression (see paper) => shorter context windows, more efficiency - significantly more general information stream => not just text, but e.g. bold text, colored text, arbitrary images. - input can now be processed with bidirectional attention easily and as default, not autoregressive attention - a lot more powerful. - delete the tokenizer (at the input)!! I already ranted about how much I dislike the tokenizer. Tokenizers are ugly, separate, not end-to-end stage. It "imports" all the ugliness of Unicode, byte encodings, it inherits a lot of historical baggage, security/jailbreak risk (e.g. continuation bytes). It makes two characters that look identical to the eye look as two completely different tokens internally in the network. A smiling emoji looks like a weird token, not an... actual smiling face, pixels and all, and all the transfer learning that brings along. The tokenizer must go. OCR is just one of many useful vision -> text tasks. And text -> text tasks can be made to be vision ->text tasks. Not vice versa. So many the User message is images, but the decoder (the Assistant response) remains text. It's a lot less obvious how to output pixels realistically... or if you'd want to. Now I have to also fight the urge to side quest an image-input-only version of nanochat...

English
337
35
990
395.3K
HAL51.AI ری ٹویٹ کیا
Bilawal Sidhu
Bilawal Sidhu@bilawalsidhu·
Generative ai is cool but procedural 3d generation just hits different. iCity basically feels like sim city for adults who know blender.
English
102
540
8.2K
532.7K
HAL51.AI ری ٹویٹ کیا
Rana Hanocka
Rana Hanocka@RanaHanocka·
We’ve been building something we’re 𝑟𝑒𝑎𝑙𝑙𝑦 excited about – LL3M: LLM-powered agents that turn text into editable 3D assets. LL3M models shapes as interpretable Blender code, making geometry, appearance, and style easy to modify. 🔗 threedle.github.io/ll3m 1/
English
11
65
404
47K
AshutoshShrivastava
AshutoshShrivastava@ai_for_success·
I am very thankful for this AI community. I have made a lot of friends, and whenever I get the opportunity, I try to meet them. In the last two weeks, I met Sanchay ( @kernelkook) very talented guy and Vinay Prabhu, the CEO and Founder of @HAL51AI X is the place to be!
AshutoshShrivastava tweet mediaAshutoshShrivastava tweet media
English
17
5
161
23.9K
Maxfield Hulker
Maxfield Hulker@MfHulker·
Generative AI is the future of visual media creation, and I’m excited to be a part of building that future. If you have ideas for how Stability could do a better job of engaging, big or small, or just want to connect, my DMs are open!
English
2
0
6
253
Maxfield Hulker
Maxfield Hulker@MfHulker·
I’m joining @StabilityAI as VP of Community! For those who don’t know me, I’ve been a Stable Diffusion user since November 2022. In fact, it was Stable Diffusion, and Stability AI more broadly, that inspired me to co-found Civitai.
English
2
0
11
14.4K
HAL51.AI
HAL51.AI@HAL51AI·
@JungleSilicon Did something with this. Looks giga cracked as f in real viewing experience 😈
English
2
0
5
426
Silicon Jungle
Silicon Jungle@JungleSilicon·
shredding pixels, shifting shapes
English
81
217
3.5K
233.3K
HAL51.AI
HAL51.AI@HAL51AI·
@JungleSilicon Is there a link we can try this? This is going to look rad in our LemurBox
English
1
0
3
281
Ritwik Pavan
Ritwik Pavan@ritwikpavan·
I'm launching a video series on the most exciting consumer hardware product launches and their origin stories. 1️⃣ Which products or founders would you like to see featured? 2️⃣ What questions should we ask? Let me know below ↓
English
11
14
64
11.2K
HAL51.AI
HAL51.AI@HAL51AI·
@openhome Based take! Would love to work with you guys BTW :)
English
0
0
0
14