Ronghang Hu

1

17

1.5K

Ronghang Hu retweetledi

Pengchuan Zhang@PengchuanZ·28 Mar

SAM 3.1 is here. SAM 3.1 is a significant improvement on video grounding: shifting from SAM2-like per-object propagation to MOT-style multi-object propagation. This change significantly reduces the computational cost, while maintaining the tracking accuracy!!!

We’re releasing SAM 3.1: a drop-in update to SAM 3 that introduces object multiplexing to significantly improve video processing efficiency without sacrificing accuracy. We’re sharing this update with the community to help make high-performance applications feasible on smaller, more accessible hardware. 🔗 Model Checkpoint: go.meta.me/8dd321 🔗 Codebase: go.meta.me/b0a9fb

English

3

7

40

6.2K

Ronghang Hu retweetledi

Xudong Lin@Xudong_Lin_AI·20 Mar

Proud of our team that makes the huge leap happen compared to last version but this is just the start. Better models are lined up and we keep improving every week. Join us towards Superhuman Multimodal Intelligence job-boards.greenhouse.io/xai/jobs/50826… !!

Arena.ai@arena

Grok 4.20 Beta Reasoning makes @xAI a top 5 lab in Vision Arena. Scoring 1240, this model ranks #11 across all Vision models today. Congrats to the @xAI team for this milestone!

English

13

33

276

60.5K

Ronghang Hu retweetledi

Ethan He@EthanHe_42·11 Mar

Happy birthday to @xAI! Only 3 years old and already an incredible list of accomplishments

xAI Memphis@xAIMemphis

Three is a good number. ✅ We have built three massive data centers ✅ Launched a coherent cluster of 330,000 GBs (more to come). ✅ Created over 3,000 jobs. ✅ Paid over $30 million in taxes to local communities. Happy Third Birthday to @xai 🎈

English

25

22

593

22.5K

Ronghang Hu retweetledi

Xudong Lin@Xudong_Lin_AI·12 Şub

Join us to build the best mutlimodal model that understands the universe! x.ai/careers/open-r…

xAI@xai

Since xAI was formed just 30 months ago, the small and talented team has made remarkable progress. The future has never looked more exciting!

English

6

91

4.9K

Ronghang Hu retweetledi

xAI@xai·29 Oca

Understanding requires imagining. Grok Imagine lets you bring what’s in your brain to life, and now it’s available via the world’s fastest, and most powerful video API: x.ai/news/grok-imag… Try it out and let your Imagination run wild.

English

522

672

4.4K

6.8M

Ronghang Hu retweetledi

Nikhila Ravi@nikhilaravi·21 Kas

One of the most highly requested features since we launched SAM 1 was the ability to prompt with text! @kate_saenko_ from SAM 3 team explains how we built an efficient data engine to collect high quality mask + text label annotations at scale and our new open vocabulary benchmark Segment Anything with Concepts (SA-Co).

Collecting a high quality dataset with 4M unique phrases and 52M corresponding object masks helped SAM 3 achieve 2x the performance of baseline models. Kate, a researcher on SAM 3, explains how the data engine made this leap possible. 🔗 Read the SAM 3 research paper: go.meta.me/6411f7

English

SAM3 live stream tomorrow; image and video segmentation, text and visual prompts, live coding Nov 20th 4:30 PM CET 10:30 AM ET youtube.com/live/G1AEuFwQr…

4

38

6.2K

Ronghang Hu retweetledi

Niels Rogge@NielsRogge·19 Kas

SAM-3 is out on @huggingface! A big upgrade from SAM-2, and Meta finally added support for text prompts. Here I tried it out on @hazardeden10's magical goal against @Arsenal using the text prompt "Chelsea player" Works pretty well!

English

8

34

345

32K

Ronghang Hu retweetledi

Roboflow@roboflow·20 Kas

we are going live tomorrow to get hands on with SAM 3 lots of demos and ongoing Q&A, join in

SkalskiP@skalskip92

English

7

92

8.7K

Ronghang Hu retweetledi

Alexandr Wang@alexandr_wang·19 Kas

Today we are releasing & open-sourcing Segment Anything 3 (SAM 3). It is a state-of-the-art model for image & video segmentation, and builds upon the work of SAM & SAM 2. SAM3 will also power features in Edits, Meta AI, & Facebook Marketplace soon. aidemos.meta.com/segment-anythi…

English

189

205

2.7K

295.5K

Ronghang Hu retweetledi

AI at Meta@AIatMeta·19 Kas

Meet SAM 3, a unified model that enables detection, segmentation, and tracking of objects across images and videos. SAM 3 introduces some of our most highly requested features like text and exemplar prompts to segment all objects of a target category. Learnings from SAM 3 will help power new features in Instagram Edits and Vibes, bringing advanced segmentation capabilities directly to creators. 🔗 Learn more: go.meta.me/591040

English

26

144

953

187.1K

Ronghang Hu retweetledi

Nikhila Ravi@nikhilaravi·30 Eyl

🚀 Excited to announce new SAM 2.1 model checkpoints & the SAM 2 Developer Suite: 🤖 We’re releasing full training/fine tuning code for SAM 2 so you can customize it for your use case. 💻For the first time we’re publishing the frontend & backend code for our SAM 2 web demo!

We’re on the ground at #ECCV2024 in Milan this week to showcase some of our latest research, new research artifacts and more. Here are 4️⃣ things you won’t want to miss from Meta FAIR, GenAI and Reality Labs Research this week whether you’re here in person or following from your feed. 1️⃣ We’re releasing SAM 2.1 an upgraded version of the Segment Anything Model 2 — and the SAM 2 Developer Suite featuring open source tools for training, inference and demos. New artifacts are live in the repo on GitHub ➡️ go.fb.me/mk6ofh 2️⃣ We’re supporting 10+ presentations and workshops in areas like computer vision for smart glasses and the metaverse, 3D vision for eCommerce, egocentric research with Project Aria and more. 3️⃣ We’re presenting seven orals at ECCV — in addition to the 50+ publications from researchers at Meta that were accepted for this year’s conference. Look out for more details on some of these papers later this week. 4️⃣ Demos and discussions with Meta researchers at our booth all week — come by our booth to discuss projects like SAM 2, Ego-Exo4D, DINOv2 and more.

English

5

23

216

24.1K

Ronghang Hu@RonghangHu·30 Eyl

Below are what's contained in the SAM 2.1 Developer Suite: - A new suite of improved model checkpoints (denoted as SAM 2.1) are released. - The training (and fine-tuning) code has been released. - The frontend + backend code for the SAM 2 web demo has been released.

English

1

5

347

Ronghang Hu@RonghangHu·30 Eyl

SAM 2.1 Developer Suite (new checkpoints, training code, web demo) is released -- check it out at github.com/facebookresear…

We’re on the ground at #ECCV2024 in Milan this week to showcase some of our latest research, new research artifacts and more. Here are 4️⃣ things you won’t want to miss from Meta FAIR, GenAI and Reality Labs Research this week whether you’re here in person or following from your feed. 1️⃣ We’re releasing SAM 2.1 an upgraded version of the Segment Anything Model 2 — and the SAM 2 Developer Suite featuring open source tools for training, inference and demos. New artifacts are live in the repo on GitHub ➡️ go.fb.me/mk6ofh 2️⃣ We’re supporting 10+ presentations and workshops in areas like computer vision for smart glasses and the metaverse, 3D vision for eCommerce, egocentric research with Project Aria and more. 3️⃣ We’re presenting seven orals at ECCV — in addition to the 50+ publications from researchers at Meta that were accepted for this year’s conference. Look out for more details on some of these papers later this week. 4️⃣ Demos and discussions with Meta researchers at our booth all week — come by our booth to discuss projects like SAM 2, Ego-Exo4D, DINOv2 and more.

English

5

11

2.1K

Ronghang Hu retweetledi

Florent Daudens@fdaudens·1 Ağu

Quite a cool demo of Segment Anything 2 (even if Mbappe shoots the ball too fast for it! ;) ) sam2.metademolab.com/demo

English

3

601

Ronghang Hu@RonghangHu·30 Tem

We are excited to release SAM 2, to segment anything in images and videos Check out the code at github.com/facebookresear… and demo at sam2.metademolab.com

Introducing Meta Segment Anything Model 2 (SAM 2) — the first unified model for real-time, promptable object segmentation in images & videos. SAM 2 is available today under Apache 2.0 so that anyone can use it to build their own experiences Details ➡️ go.fb.me/p749s5

English

3

5

69

7.4K

Ronghang Hu retweetledi

AI at Meta@AIatMeta·28 Mar

Today, we’re sharing a roundup of Meta AI’s recent cutting-edge multimodal research, which we believe will collectively lead to more interactive, immersive, and smarter AI systems of the future: ai.facebook.com/blog/advances-…

English

6

52

234

0

Ronghang Hu retweetledi

Deepak Pathak@pathak2206·13 Eki

We are presenting Worldsheet at #ICCV2021 this week as Oral. Join QnA Wed & Fri. We updated the arXiv since v1: *Multi-layered* Worldsheets to autonomously handle sharp depth discontinuities/occlusions which a single sheet may fail to capture (Sec 3.5): worldsheet.github.io/resources/worl…

Deepak Pathak@pathak2206

Excited to share Worldsheet, a method to synthesize novel views with large camera changes from a *single* image. Turns out, simply shrink-wrapping a mesh sheet onto the image captures 3D well enough to render photorealistic far-away views. w/ @RonghangHu worldsheet.github.io

English

4

26

0

Ronghang Hu retweetledi

Xin Eric Wang@xwang_lk·11 Haz

The @NAACLHLT ALVR 2021 workshop is happening tomorrow (June 11th) at 8:30am PDT!!! We have an amazing lineup of speakers to talk about recent advances in language and vision research! Talk information also provided in the program. alvr-workshop.github.io

Daniel Fried@dan_fried

Looking forward to the lineup of talks, papers, and panel discussions at tomorrow (Friday)'s ALVR workshop on language grounding (images, video, embodied control...): Program: #program" target="_blank" rel="nofollow noopener">alvr-workshop.github.io/#program Papers: #accepted-papers" target="_blank" rel="nofollow noopener">alvr-workshop.github.io/#accepted-pape… Zoom link: underline.io/events/122/ses… #NAACL2021

English