schandra

38 posts

schandra

schandra

@schandra

Katılım Ağustos 2007
66 Takip Edilen130 Takipçiler
schandra retweetledi
Alex Prompter
Alex Prompter@alex_prompter·
Meta found that forcing an llm to show its work, step by step, with evidence for every claim, nearly halves its error rate when verifying code patches the technique is embarrassingly simple: a structured template the model has to fill in before it's allowed to say "yes" or "no" no fine-tuning. no new architecture. just a checklist that won't let the model skip steps
Alex Prompter tweet media
English
64
198
2.3K
181.1K
schandra retweetledi
Rohan Paul
Rohan Paul@rohanpaul_ai·
Meta researchers created a mandatory checklist that forces AI to trace code line by line instead of blindly guessing. This structured approach boosted the accuracy of checking real-world code updates to an impressive 93%. Usually, when we ask an AI to check if a software update works, it just looks at the names of the functions and makes a very confident guess. If we want to be absolutely sure the code works, human developers normally have to run the code in expensive and slow testing servers. This paper changes that dynamic entirely by introducing a strict template that forces the AI to write down the exact path the code takes and provide hard evidence for every single claim it makes. Because the AI is forced to slow down and show its work step by step, it catches deeply hidden bugs and proves that patches work with 93% accuracy. The big deal here is that tech companies can now use AI to automatically and reliably verify millions of lines of code without ever paying for the massive computing costs required to actually execute that software. ---- Paper Link – arxiv. org/abs/2603.01896 Paper Title: "Agentic Code Reasoning"
Rohan Paul tweet media
English
34
89
768
58.5K
schandra retweetledi
Kensen Shi
Kensen Shi@kensen_shi·
🔔 Announcing our paper on Natural Language Outlines for Code! Our vision 🔮 - NL Outlines empower human developers with new forms of AI assistance throughout the software development process 🚀 Paper: arxiv.org/abs/2408.04820 FSE'25 presentation: youtube.com/watch?v=54v7Zp… 🧵👇
YouTube video
YouTube
Kensen Shi tweet mediaKensen Shi tweet media
English
1
8
23
3.1K
schandra retweetledi
David Lo
David Lo@davidlo2015·
@schandra is giving the 5th keynote (second industry talk) of @ConfForge on "AI for Software Engineering at Google: Progress and Path Ahead" :) Packed room with many standing to hear Satish experience at @Google :) If you are at @ICSEconf, pls join us :)
David Lo tweet media
English
0
2
5
116
schandra retweetledi
PLSE@NUS
PLSE@NUS@nus_plse·
Exciting News! The ICSE 2013 paper "SemFix: Program Repair via Semantic Analysis" that started our journey in program repair is recognized by the Most Influential Paper Award ten years later in 2023. Congrats to @AbhikRoychoudh1 and all co-authors! abhikrc.com/pdf/ICSE13-SEM…
PLSE@NUS tweet media
English
6
7
98
9.7K
schandra retweetledi
FSE 2026
FSE 2026@FSEconf·
Happy New Year everyone! We are looking forward to your contributions to ESEC/FSE 2023!! We will be posting updates and introducing our PC over the next few months here. Reminder that the research track paper submissions are due on February 2nd! buff.ly/3X7lP1s
English
0
10
21
3.3K
schandra retweetledi
@romeu@mastodon.social
@[email protected]@malk_zameth·
#curryon Facebook created a product that analyse their codebases and fixes people did and code reviews pull requests proposing similar fixes to be added seems very nifty
@romeu@mastodon.social tweet media
English
1
3
7
0
schandra retweetledi
Engineering at Meta
Engineering at Meta@Meta_Engineers·
We have built a new system that leverages machine learning to more efficiently detect potential regressions in a proposed code change. This predictive test selection method has doubled the efficiency of Facebook's continuous integration system. code.fb.com/developer-tool…
GIF
English
1
43
92
0
schandra retweetledi
Engineering at Meta
Engineering at Meta@Meta_Engineers·
Facebook has built a tool called Getafix that automatically finds fixes for code bugs and offers the patch to engineers to approve. Here's how it works. code.fb.com/developer-tool…
GIF
English
15
338
785
0
schandra
schandra@schandra·
@swarat It was great to have you here. Looking forward to continued collaboration with you.
English
0
0
1
0
Swarat Chaudhuri
Swarat Chaudhuri@swarat·
Finished a fun two months of work at Facebook HQ on statistical program synthesis. Very impressed by the energy in the Big Code team; some quite promising results. More on these in a few months, hopefully!
English
1
1
20
0
schandra retweetledi
Arie van Deursen
Arie van Deursen@avandeursen·
We just sent out the notifications for ESEC/@FSEconf 2017. Out of 295 submissions, the PC accepted 72 in total (some conditionally).
English
0
8
20
0
schandra retweetledi
Flow
Flow@flowtype·
Soon @flowtype will complain when you call a function with too many args. Here's a blog post explaining the change: flow.org/blog/2017/05/0…
English
3
44
104
0
schandra retweetledi
Sylvia Grewe
Sylvia Grewe@sylviagrewe·
Youngest speaker at #scala16: Kartik Chandra (still in High School) on "Automatically finding Scala Soundness Bugs"
English
0
8
16
0
schandra retweetledi
Sandro Stucki
Sandro Stucki@stuckintheory·
Kartik Chandra showing us how to automatically find soundness bugs in typecheckers at #scala16 @splashcon
Sandro Stucki tweet media
English
0
3
4
0
schandra retweetledi
Sarah Eli Judd (they)
Sarah Eli Judd (they)@SarahEJudd·
"The idea of breaking something down is something you learn in Scratch subliminally" -Kartik Chandra #ScratchMIT2016
English
0
2
5
0