asfaan murthyulas

279 posts

asfaan murthyulas

asfaan murthyulas

@AMurthyulas

Ask stupid question-Think impossible-Do step by step.

Katılım Ocak 2023
392 Takip Edilen2 Takipçiler
asfaan murthyulas
asfaan murthyulas@AMurthyulas·
@tunguz Use all thinking patterns that created aha moments in great people who solved impossible problems.
English
0
0
0
38
Bojan Tunguz
Bojan Tunguz@tunguz·
"Solve Riemann Hypothesis. Make no mistakes."
English
14
2
87
3.2K
World of Statistics
World of Statistics@stats_feed·
The inventor of Vaseline was so convinced of its healing powers that he ate a spoonful every day and had himself covered in it when he got sick. He lived to 96.
World of Statistics tweet mediaWorld of Statistics tweet media
English
211
354
4.5K
328.4K
Andrew White 🐦‍⬛
Andrew White 🐦‍⬛@andrewwhite01·
In the history of the atom, there was a short time that elliptical electron orbits were thought to be the best model (Bohr-Sommerfeld). Elliptic orbits look awesome so everyone kept it around and it's become the default visual (like the emoji - ⚛️). Totally wrong model though
Andrew White 🐦‍⬛ tweet media
English
5
5
47
2.9K
allen institute
allen institute@AllenInstitute·
Neurons don't connect randomly. In this video by our #ElectronMicroscopy team, a blue neuron's axon forms a connection to a far neuron. Along the way, it links to some neighbors while skipping thousands. Connectomics seeks to understand what makes those connections special.
English
4
50
223
12.6K
YiFan Gao
YiFan Gao@_YifanGao·
New paper!🧠We showed that the Reward Positivity backpropagates from feedback to predictive cues during reinforcement learning — first time this has been demonstrated with noninvasive EEG in humans! Huge thanks to my mentor and my co-authors! Open access: doi.org/10.1111/psyp.7…
YiFan Gao tweet media
English
0
8
40
2.4K
asfaan murthyulas
asfaan murthyulas@AMurthyulas·
@SiniiMayo പ്രത്യേകിച്ച് മതവും രാഷ്ട്രീയ പാർട്ടിയും.
മലയാളം
0
0
0
3
Sini
Sini@SiniiMayo·
നമുക്ക് ഇഷ്‌ടമല്ലാത്ത ഒരാളെ കുറിച്ച് ഒരാൾ വന്ന് കുറ്റം പറഞ്ഞാൽ നമ്മൾ ഹാപ്പിയായി കേട്ടിരിക്കും അതേ സമയം നമുക്ക് വേണ്ടപ്പെട്ട ഒരാളെ പറ്റിയാണ് പറയുന്നതെങ്കിൽ നമുക്ക് ദേഷ്യം വരും ബേസിക് സൈക്കോളജി അല്ലേ ഇത് ?
മലയാളം
8
2
28
1.1K
asfaan murthyulas
asfaan murthyulas@AMurthyulas·
@sanjaykumarpv ആന ഇവളെ കണ്ട് പേടിച്ചൊടിക്കാണും
മലയാളം
0
0
0
5
Sanjay
Sanjay@sanjaykumarpv·
ഉത്തരാഖണ്ഡിലെ ഖാതിമാ റോഡിൽ അദ്ധ്യാപികയായ യുവതി ഓടിച്ച ഹോണ്ട ആക്ടീവ റോഡു കുറുകെ കടക്കുകയായിരുന്ന ആനയെ ഇടിച്ചു. ഇടിയുടെ ശക്തിയിൽ ആനയുടെ കൊമ്പ് ഒടിഞ്ഞു. എനിക്ക് മനസിലാകാത്തത് വഴിയേ പോകുന്ന ആനയെപ്പോലും മര്യാദക്ക് കാണാൻ പറ്റാത്ത ഇവരൊക്കെ റോഡിലൂടെ പോകുന്ന മനുഷ്യരെ എങ്ങനെ കാണും.
മലയാളം
1
0
6
534
David | Cybersecurity
David | Cybersecurity@PrinceDavies55·
@jon_d_doe "Meta laid off 8,000 and shifted 7,000 into AI. You know what that means for cybersecurity? More AI making decisions. Fewer humans catching the mistakes. More attack surface. More risk. Upskill or get left behind."
English
1
0
0
168
Àgbà John Doe
Àgbà John Doe@jon_d_doe·
Meta is laying off 8,000 employees (10% of its global workforce). And re-assigning 7,000 to focus on AI development. In the next few years, it's likely to lay off 50% of its employees. Omo, there is a big threat to a lot of jobs in the future. Big one. End.
English
143
213
1.4K
21.7K
송준 Jun Song
송준 Jun Song@jun_song·
Frequent question : Are you going to work for a big tech? No. I just left corp for open source. I've received offers from multiple companies, including a frontier lab, but I turned them all down. I will stay independent and dedicate myself to Sovereign AI open source. 💪
English
28
0
162
6.7K
Open Source Intel
Open Source Intel@Osint613·
Trump: The Iran conflict will end soon.
English
33
31
411
30.6K
Rand
Rand@rand_longevity·
what is the first question you will ask the AGI?
English
305
14
222
18.2K
asfaan murthyulas
asfaan murthyulas@AMurthyulas·
@JaibyGeorge5979 വളം യുദ്ധം കഴിഞ്ഞാൽ എത്തും, പക്ഷെ പറഞ്ഞത് തിരിച്ചെടുക്കാൻ ആവില്ല.
മലയാളം
0
0
1
113
Jaiby George
Jaiby George@JaibyGeorge5979·
കഷ്ട്ടം ....ഇന്ത്യയുടെ ഒരു ഗതികേട് 😡😡😡
Jaiby George tweet media
മലയാളം
18
17
196
8.8K
Autism Capital 🧩
Autism Capital 🧩@AutismCapital·
You thought YOUR social anxiety was bad? 😂 💀
English
557
446
5.3K
702.3K
asfaan murthyulas
asfaan murthyulas@AMurthyulas·
@manoramanews America is a luxuries country they are proud about. Certainly They will find a way to stay.
English
0
0
0
113
Manorama News
Manorama News@manoramanews·
'60 ദിവസത്തിനുള്ളില്‍ പുതിയ ജോലി കണ്ടുപിടിച്ചോ, അല്ലെങ്കില്‍ യുഎസ് വിടണം'! ചങ്കിടിച്ച് ഇന്ത്യന്‍ ടെക... Read more at: manoramanews.com/gulf-and-globa… #us #india
Manorama News tweet media
മലയാളം
3
1
8
1.2K
Avi Chawla
Avi Chawla@_avichawla·
Karpathy's prediction about RL is coming true now! He called reward functions unreliable and argued that a single reward number is too low-dimensional to teach an agent what "good" means for complex tasks. To solve this, Agents need a knowledge-guided review as a higher-dimensional feedback channel. Every major AI lab trains models with RL today (OpenAI, Anthropic, DeepSeek). And their key bottleneck has always been the reward functions. GRPO by DeepSeek worked well for math and code because the environment gave a binary signal. But for real agent tasks, someone still has to hand-code the scoring function. That takes days and breaks every time the pipeline changes. RULER (implemented in OpenPipe ART, 10k stars) addresses the exact problem Karpathy identified. The reward criteria are defined in plain English, and an LLM evaluates each trajectory against that description to provide feedback for training. I trained a Qwen3 1.4B agent that plays 2048 using GRPO with this exact workflow. In this case, the agent saw the board, picked a direction, and RULER evaluated the outcome, all from this natural language definition. You can see the full implementation on GitHub and try it yourself. Here's the ART Repo: github.com/OpenPipe/ART (don't forget to star it ⭐ ) Just like RLHF replaced manual rankings and GRPO replaced the critic model, natural language rewards are replacing hand-coded scoring functions. RL reward engineering is now prompt engineering. I wrote a full walkthrough covering RL for LLM agents, from RLHF to GRPO to RULER, in the article below.
Avi Chawla@_avichawla

x.com/i/article/2048…

English
52
124
1.1K
242.3K
asfaan murthyulas
asfaan murthyulas@AMurthyulas·
@dwarkesh_sp @ericjang11 I think scaling can solve this problem. Leela zero uses transformer architecture. Cnn and rnn have proximity bias but scaled transformer can reach cnn and rnn level performance. Biases can save lots of compute and data( comes with cost of reasoning beyond bias)
English
0
0
0
103
Dwarkesh Patel
Dwarkesh Patel@dwarkesh_sp·
.@ericjang11 tried using transformers for his Go bot, but they couldn't beat ResNets. The reason gets at something general about architectures. ResNets are biased towards the local. Nearby things matter more, and a useful pattern in one place is a useful pattern anywhere. Transformers are biased the other way, towards global context, with every position able to attend to every other. Most Go fighting is local, and a useful local pattern learned in one position can be applied anywhere in the board. A ResNet's inductive bias means it gets these insights about Go for free. But a transformer has to pay for them.
English
12
13
249
28.4K