anish chhaparwal

23 posts

anish chhaparwal

anish chhaparwal

@anishchhaparwal

Katılım Haziran 2012
607 Takip Edilen21 Takipçiler
anish chhaparwal retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
Introducing Genie 2: our AI model that can create an endless variety of playable 3D worlds - all from a single image. 🖼️ These types of large-scale foundation world models could enable future agents to be trained and evaluated in an endless number of virtual environments. → goo.gle/4f48y2e
English
342
1.2K
6.4K
1.4M
Jeremy Howard
Jeremy Howard@jeremyphoward·
For folks that don't use conda, how do you debug problems that only occur on some particular Python or CUDA version? With conda, I have a few envs set up with various python and CUDA versions, so I just activate one and get to work. With something like uv, what do you do?
English
68
13
384
81.8K
anish chhaparwal retweetledi
Bindu Reddy
Bindu Reddy@bindureddy·
And Just Like That - Tencent just dropped an excellent open-source video model! IT LOOKS AWESOME The final gap that we had in open-source AI - namely a video-gen model - is now closed 😀 AS USUAL, CHINA RULES
English
19
36
215
16.7K
anish chhaparwal retweetledi
World Labs
World Labs@theworldlabs·
Most generative models predict pixels. Predicting a 3D scene instead has many benefits: the scene won’t change if you look away and come back, and it obeys the basic physical rules of 3D geometry. The simplest way to visualize the 3D scene is a depth map, where each pixel is colored by its distance to the camera. 4/n
English
2
8
139
16.6K
anish chhaparwal
anish chhaparwal@anishchhaparwal·
@alexalbert__ @alexalbert__ you guys can start an official api repository mpc that can provide Claude with context via /llms.txt for popular standard libraries.
English
0
0
0
139
Alex Albert
Alex Albert@alexalbert__·
It's been great to see the community's response to MCP! This launch is just the starting point - the team is already working on fast-followup improvements to the docs and Claude Desktop app experience based on your initial feedback. MCP's true potential will unfold in the coming months, but we wanted to get it into devs' hands early so you could help shape its future. Your critiques, suggestions, and use cases are incredibly valuable in ensuring MCP becomes a protocol that works for everyone. Keep the feedback coming!
English
32
12
351
31.4K
anish chhaparwal retweetledi
Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastav@reach_vb·
OH WOW! The Whale aka @deepseek_ai is BACK!! New model, with complete reasoning outputs and a gracious FREE TIER too! 🔥 Here's a quick snippet of it searching the web for the right documentation, creating the JS files plus the necessary HTML all whilst handling Auth too ⚡ I really hope they Open release the model checkpoints too!
English
7
44
277
27.8K
kache
kache@yacineMTB·
decorators are an antipattern and should be avoided
kache tweet media
English
234
93
2.6K
270.9K
anish chhaparwal retweetledi
Denny Zhou
Denny Zhou@denny_zhou·
What is the performance limit when scaling LLM inference? Sky's the limit. We have mathematically proven that transformers can solve any problem, provided they are allowed to generate as many intermediate reasoning tokens as needed. Remarkably, constant depth is sufficient. arxiv.org/abs/2402.12875 (ICLR 2024)
Denny Zhou tweet media
English
107
511
3K
795K
Jeremy Howard
Jeremy Howard@jeremyphoward·
Anyone know of a MacOS or Windows app that does real time voice recognition (i.e hold down a hotkey, and your speech is converted to text and placed into the active text area), BUT can work with a custom model? I'd like to fine-tune whisper for my voice and use it with my PCs.
English
65
6
351
88.2K
Tom Dörr
Tom Dörr@tom_doerr·
As context windows get longer, should we just add the complete repo for coding?
Tom Dörr tweet media
English
6
5
45
4.3K
Tom Dörr
Tom Dörr@tom_doerr·
I feel bad for people applying to jobs on LinkedIn themselves
Tom Dörr tweet media
English
2
2
24
2.3K
Rohan Paul
Rohan Paul@rohanpaul_ai·
Quite incredible setup for running your own AI cluster at home with everyday devices. @exolabs_ and @__tinygrad__ backend. The key is that exo optimally splits up models based on the current network topology and device resources available. See the code in image shows a partition strategy. 👉 Topology: So, the `Topology` object represents the entire network of available devices, their capabilities, and their connections. It's a flexible representation of how the devices in the network are connected and what their capabilities are. The partitioning strategy uses this to make decisions about how to divide the model across the available devices. To make sense of the "Ring" in the name, you can think of it like devices are conceptually arranged in a ring, where each device is connected to two neighbors. This allows for efficient communication between partitions. 👉 Memory Weighted: The strategy weighs each device's contribution based on its available memory. Devices with more memory get larger portions of the model. 👉 Here's what the code actually does: - It sorts the nodes primarily by memory capacity (in descending order) and secondarily by node ID. - It then allocates partitions to these nodes based on their proportional memory capacity. - The partitions are represented as continuous ranges from 0 to 1
Rohan Paul tweet media
Alex Cheema@alexocheema

Connected my MacBook Pro GPU to my Linux laptop NVIDIA GPU using @exolabs_ Running really large AI models with 193 TFLOPs of compute, combining the M3 GPU + RTX 4090

English
6
21
221
33.6K
Pratham
Pratham@Prathkum·
I want to collaborate with someone with whom I can build a SaaS and run it for at least a couple of months to generate some revenue. Drop a 👋 if you are interested.
English
171
8
324
48.1K
Alex Albert
Alex Albert@alexalbert__·
We've got some exciting things coming up in the pipeline but we want to ship even more features that people want. What do you wish we added/fixed on claude dot ai or the API?
English
386
21
496
73K
anish chhaparwal
anish chhaparwal@anishchhaparwal·
@svpino @riverarodrigoa @hridayakuhara I think people will read your content if you tell them it’s promoted in the first line too. At least for the first few times. The quality of products you promote will determine if they continue to do so. Not a fan of dubiously being subjected to promotion.
English
1
0
1
88
Santiago
Santiago@svpino·
Will you read the content if I tell you in the first line is promoted? I’m sure the answer is no. You don’t trust the kind of selection I make, so you won’t have any reason to trust it will be good. Therefore, that’s the trade off. I make the selection, you read and decide for yourself. If you don’t like what I bring, you can ignore it. I think it’s a fair trade.
English
1
0
0
166
Santiago
Santiago@svpino·
If something could kill Jupyter notebooks, this would be it. Zerve is a web-based IDE. It's block-based, like a notebook, but it follows a different philosophy, and the experience is very different. And they just released a killer feature! You can now drag and drop any model from HuggingFace, a dataset, and fine-tune it using your infrastructure and GPUs. No privacy concerns, no data leaks, and no need to manage any infrastructure. Fine-tune your model with your data and host it in your environment without leaving the tool. A lot is happening in the attached screenshot, but that's an end-to-end process to fine-tune Gemma 2B. This is huge, but that's not all: 1. Zerve is web-based, so you can access your work anywhere 2. Collaboration is a first-class citizen 3. You can deploy your code directly 4. It promotes code modularity by using blocks 5. You can use multiple languages together 6. You can run code blocks in parallel Zerve is new, but it has the potential to change how data scientists and machine learning practitioners write code. I'm rooting for them. The tool is free. You can start right now by visiting. Thanks to their team for collaborating with me on this post.
Santiago tweet media
English
26
136
1.1K
158.5K
anish chhaparwal retweetledi
Sasank Chilamkurthy
Sasank Chilamkurthy@sasank51·
What if you could send preliminary AI radiology report to the referring physicians within minutes of CT scan being done? That's exactly what Dr. Arunkumar and his team at @AarthiScans are doing in this partnership with us! Ground breaking stuff!
Qure.ai@qure_ai

@AarthiScans is the first Pan-India Diagnostic Chain to provide #AI powered triaging CT Reports in all their 36 radiology branches.

English
0
3
11
0