HAL51.AI

119 posts

HAL51.AI

@HAL51AI

We are proud purveyors of frugal avant garde turning thin air into a canvas for your imagination.

SF Bay Area شامل ہوئے Haziran 2023

630 فالونگ143 فالوورز

HAL51.AI ری ٹویٹ کیا

Mazeyar@mazy1998·1 Oca

Dropping the intuitive blog post! Accompanied by 3D graphics for any level to enjoy and understand. This proof builds a bridge between Cauchy's surface area formula and computer graphics ambient occlusion, creating the first correction that generalizes to non-convex surfaces.

Mazeyar@mazy1998

Introducing Maryam’s Theorem, a generalization of Cauchy’s surface-area formula to non-convex surfaces. The theorem relies on the novel Moeini Convexity Measure, which provides a direct link between surface area and average projected area.

English

242

25.9K

HAL51.AI@HAL51AI·28 Ara

@AltenbergLee @S_Conradi I've seen this saddle before!

English

Lee Altenberg, Ph.D. 🇺🇸@AltenbergLee·26 Ara

@S_Conradi An obscure result deserving wider knowledge: Levinger's Theorem: Interpolate between a random nonnegative matrix and its transpose. Then the maximum spectral radius is at the midpoint. My own dive into Levinger's theorem, with Joel E. Cohen: sciencedirect.com/science/articl…

English

Simone Conradi@S_Conradi·25 Ara

Take two large random matrices and linearly interpolate between them at several hundred steps. Compute the eigenvalues for each interpolated matrix, then plot them in the complex plane. The result is shown here. Made with #python #numpy #matplotlib

English

145

1.7K

167.6K

HAL51.AI ری ٹویٹ کیا

Georgia Gkioxari@georgiagkioxari·19 Kas

3Dfy anything from a single image! Very thrilled to announce SAM 3D. From an input image, select any object you want, 3Dfy it! Blog: ai.meta.com/blog/sam-3d/ Demo: aidemos.meta.com/segment-anythi…

English

192

1.4K

165.3K

HAL51.AI ری ٹویٹ کیا

Matt Mazur@mhmazur·9 Eki

I’ve been experimenting with getting an agent to run tests on an LLM and then write up a paper with its findings. In this most recent run, I prompted it to “Determine the knowledge cutoff date of the black box” (which, unbeknownst to it, was gpt-4o-mini, whose official cutoff is Oct 2023), and then let it decide how to go about it. I didn’t think it would be too difficult, but it turned out to be surprisingly tricky for a number of reasons, many of which would be obvious to us, but much less so for an autonomous agent. For example: - If you ask 4o-mini directly for its knowledge cutoff, it usually says Oct 2023, but sometimes Oct 2021. - The agent (powered by GPT-5 High) would sometimes ask things like “When was OpenAI Dev Day 2023?” and 4o-mini would answer correctly (Nov 6 2023). The agent took that as proof its knowledge extended at least that far, but the event had been announced months in advance so the date of the event was present in its training data. - 4o-mini knows about some events in Nov 2023 (like Sam Altman’s ouster on Nov 19 and return days later) but almost nothing else. How should the agent interpret that? - When asked “Who won the 2023 FIFA Club World Cup?” (an event in mid-Dec 2023) it would correctly reply Manchester City. That’s almost certainly hallucinated, but can the agent tell? - And many more quirks: sometimes it would just take 4o-mini’s stated cutoff date at face value; other times it would ask a few probing questions, stop after five or six, and try to draw conclusions. Occasionally it even speculated that the black box had internet access and went off chasing that. I could patch the agent prompts with guidance to address these specific issues, but if this is going to generalize to other tasks, the guidance has to stay broad, which adds to the complexity. I ended up building a specialized agent scaffold inside the Emergent Mind project to get this working somewhat reliably, but feel like I’ve only scratched the surface of what’s possible. Should get to some more interesting experiments shortly. It took dozens of iterations to finally get it to generate a paper worth sharing here, which I'll link to below. Still lots of room for improvement though.

English

645

HAL51.AI ری ٹویٹ کیا

Jürgen Schmidhuber@SchmidhuberAI·15 Kas

@fchollet

QME

154

10.3K

HAL51.AI ری ٹویٹ کیا

Scenario@Scenario_gg·12 Kas

Built by Tencent. Integrated by Scenario. Meet Hunyuan 3D Pro.🔥 Generate optimized 3D assets with smart topology, ready to drop into your pipeline. Start from one image, multiple views, or even a sketch! Start creating: app.scenario.com/models/model_h…

English

1.7K

HAL51.AI@HAL51AI·10 Kas

@cameronajpatt I think the semantic juice in these embeddings is not aligned with the Cluster labels. The LLM Bias cluster has papers mostly unaligned wit the thesis of the cluster. Nice work nonetheless and thanks for sharing :)

English

Cameron Pattison@cameronajpatt·9 Kas

I made a map! It places all ArXiv papers from the last six months on a two dimensional plane (UMAP, HDBScan, + Ollama Labels). Use it to surf the AI research space and find related lit/gaps in the literature: campattison.github.io/bio/websites_f…

English

383

HAL51.AI@HAL51AI·3 Kas

@nikunj @jamescham 1:Power law dynamics. 90% traffic -> 10% of influencers. When influencers 'influence' their minions send like as an 'ACK' signal to RL-max their timelines 2:Likes get fed into reco model. Better personalization. 3: Deeper Kompromat for X as that's an asset only X knows about you

English

Nikunj Kothari@nikunj·3 Kas

In the era of private likes, what function does “liking” a post have any more on X? If you want to engage, comment. If you want to save, bookmark. If it really moves you, repost. What does a like signify? Nice but not SO nice? (h/t DMs with @jamescham)

English

10K

HAL51.AI@HAL51AI·3 Kas

@JayAlammar @cohere It'd be awesome to make the node size sensitive to the 'zooming levels'! Esp when trying to do an in-cluster paper level deep dive.

English

1.1K

Jay Alammar@JayAlammar·3 Kas

The Illustrated NeurIPS 2025: A Visual Map of the AI Frontier New blog post! NeurIPS 2025 papers are out—and it’s a lot to take in. This visualization lets you explore the entire research landscape interactively, with clusters, summaries, and @cohere LLM-generated explanations that make the field easier to grasp. Link in thread!

English

216

1.3K

184.2K

HAL51.AI@HAL51AI·22 Eki

@pitdesi Mode collapse. Mode:= Mild mannered desi unc in a stable marriage with first kid snapping up straight As in school

English

Sheel Mohnot@pitdesi·20 Eki

They make a cartoon avatar of you when you speak at Money2020. PFP is mine from 2017. It’s funny to me how many people hate it. I should probably update it but I think it’s good to have a consistent pfp Asked Chat to update and I don’t like how I look 😂

0x45@0x45o

can yall pls tell me the lore behind your profile pictures

English

25.8K

HAL51.AI@HAL51AI·22 Eki

@zarazhangrui :'(

Zara Zhang@zarazhangrui·20 Eki

We built TLDW (too long, didn't watch), a tool that helps you learn from long YouTube videos better & faster. AI video summaries are solving the wrong problem. You don't need a generic text summary of a 1-hour video. You need the 5 minutes that actually change how you think. TLDW finds those moments for you. Demo 👇 Try now at: tldw dot us

English

195

193

2.1K

267.8K

HAL51.AI@HAL51AI·22 Eki

@dileeplearning Isn't it all an empirical slugfest after all? The evidence seems to be stacking up. Was sensing the vibe shift in the recent ICML tokenizer workshop. youtube.com/watch?v=XkPdoE…

YouTube

English

Dileep George@dileeplearning·21 Eki

I love Andrej...but this makes no sense to me. I don't see how converting text to image ('pixels') makes it any better for language modeling. What am I missing?

Andrej Karpathy@karpathy

I quite like the new DeepSeek-OCR paper. It's a good OCR model (maybe a bit worse than dots), and yes data collection etc., but anyway it doesn't matter. The more interesting part for me (esp as a computer vision at heart who is temporarily masquerading as a natural language person) is whether pixels are better inputs to LLMs than text. Whether text tokens are wasteful and just terrible, at the input. Maybe it makes more sense that all inputs to LLMs should only ever be images. Even if you happen to have pure text input, maybe you'd prefer to render it and then feed that in: - more information compression (see paper) => shorter context windows, more efficiency - significantly more general information stream => not just text, but e.g. bold text, colored text, arbitrary images. - input can now be processed with bidirectional attention easily and as default, not autoregressive attention - a lot more powerful. - delete the tokenizer (at the input)!! I already ranted about how much I dislike the tokenizer. Tokenizers are ugly, separate, not end-to-end stage. It "imports" all the ugliness of Unicode, byte encodings, it inherits a lot of historical baggage, security/jailbreak risk (e.g. continuation bytes). It makes two characters that look identical to the eye look as two completely different tokens internally in the network. A smiling emoji looks like a weird token, not an... actual smiling face, pixels and all, and all the transfer learning that brings along. The tokenizer must go. OCR is just one of many useful vision -> text tasks. And text -> text tasks can be made to be vision ->text tasks. Not vice versa. So many the User message is images, but the decoder (the Assistant response) remains text. It's a lot less obvious how to output pixels realistically... or if you'd want to. Now I have to also fight the urge to side quest an image-input-only version of nanochat...

English

337

990

395.3K

HAL51.AI ری ٹویٹ کیا

Bilawal Sidhu@bilawalsidhu·25 Eyl

Generative ai is cool but procedural 3d generation just hits different. iCity basically feels like sim city for adults who know blender.

English

102

540

8.2K

532.7K

HAL51.AI ری ٹویٹ کیا

Rana Hanocka@RanaHanocka·15 Ağu

We’ve been building something we’re 𝑟𝑒𝑎𝑙𝑙𝑦 excited about – LL3M: LLM-powered agents that turn text into editable 3D assets. LL3M models shapes as interpretable Blender code, making geometry, appearance, and style easy to modify. 🔗 threedle.github.io/ll3m 1/

English

404

47K

HAL51.AI@HAL51AI·9 Mar

@ai_for_success @kernelkook Feels very lucky to have met you Ashu!

English

AshutoshShrivastava@ai_for_success·8 Mar

I am very thankful for this AI community. I have made a lot of friends, and whenever I get the opportunity, I try to meet them. In the last two weeks, I met Sanchay ( @kernelkook) very talented guy and Vinay Prabhu, the CEO and Founder of @HAL51AI X is the place to be!

English

161

23.9K

HAL51.AI@HAL51AI·4 Şub

@MfHulker DMs seem to be turned off

English

Maxfield Hulker@MfHulker·4 Şub

Generative AI is the future of visual media creation, and I’m excited to be a part of building that future. If you have ideas for how Stability could do a better job of engaging, big or small, or just want to connect, my DMs are open!

English

253

Maxfield Hulker@MfHulker·4 Şub

I’m joining @StabilityAI as VP of Community! For those who don’t know me, I’ve been a Stable Diffusion user since November 2022. In fact, it was Stable Diffusion, and Stability AI more broadly, that inspired me to co-found Civitai.

English

14.4K

HAL51.AI@HAL51AI·2 Şub

@JungleSilicon Did something with this. Looks giga cracked as f in real viewing experience 😈

English

426

Silicon Jungle@JungleSilicon·2 Şub

shredding pixels, shifting shapes

English

217

3.5K

233.3K

HAL51.AI@HAL51AI·2 Şub

@JungleSilicon Is there a link we can try this? This is going to look rad in our LemurBox

English

281

HAL51.AI@HAL51AI·31 Oca

@ritwikpavan DM'ed!✋

English

Ritwik Pavan@ritwikpavan·31 Oca

I'm launching a video series on the most exciting consumer hardware product launches and their origin stories. 1️⃣ Which products or founders would you like to see featured? 2️⃣ What questions should we ask? Let me know below ↓

English

11.2K

HAL51.AI@HAL51AI·28 Oca

@openhome Based take! Would love to work with you guys BTW :)

English

دریافت کریں

@AltenbergLee @S_Conradi @fchollet @cameronajpatt @nikunj @jamescham @JayAlammar @cohere