Matt Sims

89 posts

Matt Sims banner
Matt Sims

Matt Sims

@mbwsims

Founder at Better Engines. Previously: Head of Applied AI at Spotter, Machine Learning at Sudowrite, Visiting Scientist at Netflix, and Postdoc at UC Berkeley.

Oakland, CA Beigetreten Eylül 2015
751 Folgt388 Follower
Angehefteter Tweet
Matt Sims
Matt Sims@mbwsims·
Teach Claude Code to think systematically. I got tired of having the same conversation with Claude Code. Review this for security. Are these tests sufficient? Can you find patterns in my codebase and update the instruction files? The answers were ok but inconsistent: no clear methodology, no memory between sessions, no systematic depth. So I built one Claude Code plugin, then another. Before I knew it I had five, covering instruction files, test coverage, security, codebase analysis, and code evolution. I decided to merge them into one integrated plugin. Claude universe was available so I figured why not… The Claude Universe plugin: teach Claude Code to think systematically Github link (entirely open source): github.com/mbwsims/claude… More at claudeuniverse.com
English
3
8
70
18.3K
Matt Sims retweetet
Paul Bakaus
Paul Bakaus@pbakaus·
Design inside your codebase. Introducing Impeccable 3.0: ▸ 1 skill, self-contained, 23 commands ▸ /impeccable live: pick in-browser, get prod-grade variants, accept writes to *source* ▸ reads+writes DESIGN.md + PRODUCT.md ▸ brand & product design impeccable.style
English
36
28
711
53.3K
Matt Sims
Matt Sims@mbwsims·
It took 8 years from this post (thanks to some major advancements in LLMs) for this to become easy to implement. But the bigger takeaway is that there's a ton of theoretical concepts in computing and technology from the last century that are now almost effortless to build. We should all be doing a better job of reading those archives. They're full of some pretty incredible ideas.
Andrej Karpathy@karpathy

"As We May Think" Vannevar Bush in 1945 trying to predict future theatlantic.com/amp/article/30… "A memex is a device in which an individual stores all his books, records, and communications, [...] it may be consulted with exceeding speed and flexibility. [...] supplement to his memory."

English
1
0
2
223
Matt Sims retweetet
Peter Hollens
Peter Hollens@PeterHollens·
@mbwsims MY DUDE!!!! LET'S CHAT SOON! WOULD LOVE TO BREAK YOUR STUFF! ;) haha i'll text ya
English
1
0
4
1.1K
Matt Sims
Matt Sims@mbwsims·
Teach Claude Code to think systematically. I got tired of having the same conversation with Claude Code. Review this for security. Are these tests sufficient? Can you find patterns in my codebase and update the instruction files? The answers were ok but inconsistent: no clear methodology, no memory between sessions, no systematic depth. So I built one Claude Code plugin, then another. Before I knew it I had five, covering instruction files, test coverage, security, codebase analysis, and code evolution. I decided to merge them into one integrated plugin. Claude universe was available so I figured why not… The Claude Universe plugin: teach Claude Code to think systematically Github link (entirely open source): github.com/mbwsims/claude… More at claudeuniverse.com
English
3
8
70
18.3K
Matt Sims retweetet
Paul Bakaus
Paul Bakaus@pbakaus·
Introducing Impeccable 2.0. • data-driven skill rewrite (evals across 7 niches) → better font/color diversity • /critique: subagent de-bias + deterministic anti-pattern detection • visual mode: /critique, CLI, (soon) Chrome • npx impeccable detect (files + URLs) Demo:
English
29
45
536
78.2K
Matt Sims
Matt Sims@mbwsims·
Some valuable research on LLMs, fiction writing, and how to measure creativity: "One of the main contributions of our work is the collection of 14 tests, referred to as the Torrance Test for Creative Writing (TTCW), to evaluate creativity in short fictional stories."
Tuhin Chakrabarty @ ICLR 🇧🇷@TuhinChakr

Can #GPT4 ever write fiction that matches the quality of @NewYorker fiction? Bothered by claims about AI surpassing human creativity🤔? Good news🥁:AI is still 3-10X worse at creativity based on our rubric "Torrance Tests for Creative Writing” #NLProc #HCI arxiv.org/pdf/2309.14556…

English
0
1
11
959
Matt Sims retweetet
Xian Li
Xian Li@xl_nlp·
We were wondering: can we build a good instruction LLaMa W/O relying on large amounts of human annotations or distillation from other models? We found a scalable recipe where the model itself can be put in the loop of generating and curating finetuning data to further self-train
Jason Weston@jaseweston

🚨New Paper 🚨 Self-Alignment with Instruction Backtranslation - New method auto-labels web text with instructions & curates high quality ones for FTing - Our model Humpback 🐋 outperforms LIMA, Claude, Guanaco, davinci-003 & Falcon-Inst arxiv.org/abs/2308.06259 (1/4)🧵

English
2
21
142
60.6K
Matt Sims retweetet
Jerry Wei
Jerry Wei@JerryWeiAI·
New @GoogleAI paper! 📜 Language models repeat a user’s opinion, even when that opinion is wrong. This is more prevalent in instruction-tuned and larger models. Finetuning with simple synthetic-data (github.com/google/sycopha…) reduces this behavior. arxiv.org/abs/2308.03958 1/
Jerry Wei tweet media
English
12
128
599
193.1K
Matt Sims retweetet
Emma Pierson
Emma Pierson@2plus2make5·
New working paper quantifying arXiv publication patterns in the age of LLMs! Joint work with @rajivmovva, @sidhikab1, @kennylpeng, @gsagostini, and @NikhGarg. We analyze LLM citation patterns, fastest growing topics, many other things. Some of our findings: 1/N
GIF
English
6
32
142
45.8K
Matt Sims retweetet
Joelle Pineau
Joelle Pineau@jpineau1·
Llama 2 is out! ai.meta.com/llama This new version has better language generation, more layers of safety, a broad set of partners – and a license that authorizes commercial use. I continue to believe that an open approach is the right path to build better models!
English
9
89
480
85.1K
Matt Sims retweetet
Dreaming Tulpa 🥓👑
Dreaming Tulpa 🥓👑@dreamingtulpa·
Animate-A-Story is a video storytelling approach which can synthesize high-quality, structured, and character driven videos. Composition and scene transitions are still early days, but interesting to see how a first text-to-story pipeline looks like. videocrafter.github.io/Animate-A-Story
English
3
9
54
4.5K
Matt Sims
Matt Sims@mbwsims·
@RoyPrice For sure - if you send me a DM with your email I'll add you to the closed beta.
English
0
0
0
20
Roy Price
Roy Price@RoyPrice·
It will improve but I think people worry too much about AI writing screenplays. Bigger opportunity in analyzing story, helping writers outline and in finding facts. In writing, I see it becoming a tool more than anything else.
English
5
2
23
5K
Matt Sims retweetet
james yu
james yu@jamesjyu·
We released a big update to the @sudowrite Summarize function - it now generates loglines along with a more coherent summary of the plot Here's its take on my short story In the Space of Twelve Minutes uncannymagazine.com/article/in-the… Reply with a story and I'll run it!
james yu tweet media
English
5
3
19
0
Matt Sims
Matt Sims@mbwsims·
@Ted_Underwood Yeah, one of the authors' arguments is that the strict left-to-right generation of autoregressive LMs is a strong limiting factor for complex controllable generation tasks, a shortcoming that the denoising process of diffusion models effectively avoids
English
0
0
1
0