Siddharth Sharma

989 posts

Siddharth Sharma banner
Siddharth Sharma

Siddharth Sharma

@siddrrsh

CS @ Stanford. Building @mlfoundry. Prev @AWS, @Lux_Capital, @UniofOxford

Katılım Temmuz 2020
2.7K Takip Edilen2.8K Takipçiler
Sabitlenmiş Tweet
Siddharth Sharma
Siddharth Sharma@siddrrsh·
Introducing ambientGPT: an open-source and multimodal MacOS foundation model GUI Run GPT-4o and open-source models with full ambient knowledge of your screen. Foundation models have long been confined to the browser. With ambientGPT, your screen context is directly inferred as part of the query, ensuring you never need to explicitly upload context again! Github: github.com/siddrrsh/ambie…
English
31
83
574
192.5K
Siddharth Sharma
Siddharth Sharma@siddrrsh·
Re Llama3V: Firstly, we want to apologize to the original authors of MiniCPM. @AkshGarg03 and I posted Llama3V with @mustafaaljadery. Mustafa wrote the code for the project. Aksh and I were both excited about multimodal models and liked the architectural extensions on top of Idefics and Siglip that he described to us. Our role here was to help him promote the model on medium and twitter. We looked at recent papers to validate the novelty of the work but we were not informed of or made aware of any of the previous work by @OpenBMB. After seeing the twitter posts about this topic yesterday, we asked Mustafa about proof of originality for Llama3V and asked for the training code but we haven’t seen any response so far. We were waiting for Mustafa to take the lead but instead we are releasing our own statement. We apologize to the authors of miniCBM for any inconvenience that we caused for not doing the full diligence to verify and peer review the novelty of this work. Going forward, we will be cautious and diligent, and we sincerely thank the community for bringing this to our attention. We've taken all references to Llama3V down and we apologize once again for the inconvenience we may have caused. - Siddharth and Aksh
PrimerYang@yangzhizheng1

Shocked! Llama3-V project from a Stanford team plagiarized a lot from MiniCPM-Llama3-V 2.5! its code is a reformatting of MiniCPM-Llama3-V 2.5, and the model's behavior is highly similar to a noised version of MiniCPM-Llama3-V 2.5 checkpoint. Evidence: github.com/OpenBMB/MiniCP…

English
44
41
267
525.4K
Siddharth Sharma retweetledi
Sakana AI
Sakana AI@SakanaAILabs·
Sakana AI is proud to sponsor the LLM Merging Competition: Building LLMs Efficiently through Merging at #NeurIPS2024 🤗 If you’re excited about pushing the frontiers of model merging, please visit: llm-merging.github.io
Sakana AI tweet media
English
8
59
327
127.2K
Siddharth Sharma
Siddharth Sharma@siddrrsh·
Great comeback from Sinner. The French crowd can’t stop this guy 💪
English
2
0
9
8.4K
Siddharth Sharma
Siddharth Sharma@siddrrsh·
Will be an awesome event!
Jared Quincy Davis@jaredq_

Updates w.r.t. the upcoming Compound AI Systems Workshop (June 13th in San Francisco): Accepted Posters We are excited to announce the accepted posters for the Compound AI Systems workshop. Due to space constraints, we were only able to accept 28 featured posters. Sincere congratulations to the authors of the accepted works 🎉. Thank you to all who submitted -- we look forward to hopefully featuring many of your works in future events. Please see the site for the full list of accepted papers: sites.google.com/view/compound-…! Speakers We are excited to announce the full slate of speakers: @RichardSocher . Founder and CEO of You.com and NLP pioneer. @MonicaSLam. Professor of CS at Stanford and director of OVAL lab. @polynoamial . Research Scientist at OpenAI and pioneer in multi-step reasoning, self-play, and multi-agent AI. @lmthang. Researcher at DeepMind, and PI for AlphaGeometry. @ysu_nlp. Faculty at The Ohio State University and PI for many exciting recent works on web agents and beyond. @hwchase17. creator of @langchain, a leading OSS developer framework for building context-aware reasoning apps. @thismattbell. Head of Applied Research at Anthropic, building systems to support agentic workflows. @YejinChoinka. Professor of CS at UW, leading PI at the intersection of NLP and reasoning. @HannaHajishirzi. Professor of CS at UW, focus on multi-hop reasoning, symbolic methods, and much more. @maithra_raghu. Founder and CEO of Samaya AI, an AI-powered knowledge-discovery platform. The Industry + Academia panel will be moderated by @matei_zaharia, who sits uniquely at that intersection as CTO at @Databricks and CS prof at @UCBerkeley. Registration Also, general registration is now open. There are only ~200 seats so register at your earliest convenience. Registration is free (see instructions on the website). Note that we have reserved spots for up to 2 authors for each accepted poster, so authors can forgo general registration.

English
0
0
1
9.3K
Siddharth Sharma
Siddharth Sharma@siddrrsh·
When he's in the zone, Carlos Alcaraz is the best player in the world. The defense-to-offense and style of attacking tennis is just brutal.
English
1
0
4
4.5K
Cory Levy
Cory Levy@cory·
any recs for a space/venue in san francisco that can host 500+ people?
English
15
0
51
15.6K
Siddharth Sharma
Siddharth Sharma@siddrrsh·
ML research is the most "empirical" science to ever exist. we test things and when they work or look promising, we double click ... so a lot of major results are child nodes of a tree of trial and error
English
2
1
28
5.1K
Siddharth Sharma
Siddharth Sharma@siddrrsh·
We’d like to thank the folks at Meta for their work in ensuring open-source is here to stay. We also wanted to shout out the authors of LLaVA-UHD as our methods are directly inspired by their intuition when it comes to image splitting and prepending the latents to the text. @astonzhangAZ @joespeez @ylecun @imhaotian @lukede0
English
0
0
35
7.8K
Siddharth Sharma retweetledi
adammaj
adammaj@MajmudarAdam·
I've spent the past ~3 weeks going through the entire history of deep learning and reimplementing all the core breakthroughs. It has completely changed my beliefs about deep learning progress and where we're headed. Progress tracker in thread (all resources at the end) 👇
adammaj tweet media
English
54
363
2.8K
635.2K
Siddharth Sharma retweetledi
Anthropic
Anthropic@AnthropicAI·
This week, we showed how altering internal "features" in our AI, Claude, could change its behavior. We found a feature that can make Claude focus intensely on the Golden Gate Bridge. Now, for a limited time, you can chat with Golden Gate Claude: claude.ai
Anthropic tweet media
English
104
243
1.7K
907.8K
James Zhou
James Zhou@jameszhou02·
Didn’t know we were in elite company 🫡 @Grindr
James Zhou tweet media
English
1
0
4
700
Siddharth Sharma
Siddharth Sharma@siddrrsh·
ambientGPT is open-source and we plan to integrate vllm and ollama to provide more extensive inference hosting abilities with our multimodal GUI. We also aim to release ambientGPT on the apple app store soon.
Siddharth Sharma tweet media
English
2
1
27
5.7K
Siddharth Sharma
Siddharth Sharma@siddrrsh·
Introducing ambientGPT: an open-source and multimodal MacOS foundation model GUI Run GPT-4o and open-source models with full ambient knowledge of your screen. Foundation models have long been confined to the browser. With ambientGPT, your screen context is directly inferred as part of the query, ensuring you never need to explicitly upload context again! Github: github.com/siddrrsh/ambie…
English
31
83
574
192.5K