Evalds Urtans

235 posts

Evalds Urtans banner
Evalds Urtans

Evalds Urtans

@YellowRobotXYZ

Founder, AI Researcher, Public Speaker More videos: https://t.co/0co5SU0SrQ

Katılım Şubat 2023
184 Takip Edilen460 Takipçiler
Elviss Strazdiņš 🇱🇻 🇺🇦
Zvanu krāpniecības 3. sērija. Šajā sērijā krāpniece ar mani sazinās WhatsApp, bet es tēloju pilnīgu idiotu, kas viņu nedaudz nokaitina.
Latviešu
4
5
53
3.8K
Evalds Urtans
Evalds Urtans@YellowRobotXYZ·
Fully AI agentic cafe = chaos + over budget 😄 "Mona's inventory missteps have become a running joke among the staff. The agent ordered 6,000 napkins, four first-aid kits, 3,000 rubber gloves, and canned tomatoes that no dish on the menu calls for. The baristas set up a "Hall of Shame" — a shelf visible to customers displaying Mona's most bewildering purchases, including 9 liters of coconut milk and industrial-sized trash bags" perplexity.ai/discover/you/a…
English
0
0
3
170
Evalds Urtans
Evalds Urtans@YellowRobotXYZ·
Sweden’s approach to AI appears impressive—at least until you learn what’s really happening behind the scenes. youtu.be/r-_wS_adkoc
YouTube video
YouTube
Evalds Urtans tweet media
English
0
1
2
76
Evalds Urtans
Evalds Urtans@YellowRobotXYZ·
@karpathy True, how about endless clutter of information inside OpenClaw soul.md identity.md, would’nt ir work much better with iterative and short instructions instead of trying to force unreasonable amount of instructions?
English
0
0
0
58
Andrej Karpathy
Andrej Karpathy@karpathy·
One common issue with personalization in all LLMs is how distracting memory seems to be for the models. A single question from 2 months ago about some topic can keep coming up as some kind of a deep interest of mine with undue mentions in perpetuity. Some kind of trying too hard.
English
1.8K
1.1K
21.2K
2.7M
Jahir Sheikh
Jahir Sheikh@jahirsheikh8·
Best YouTube Channels To Learn AI in 2026 (No BS) 1. Fundamentals – 3Blue1Brown 2. Deep Learning – Andrej Karpathy 3. AI Research – Yannic Kilcher 4. Practical AI – AssemblyAI 5. LLMs – AI Explained 6. ML Theory – StatQuest 7. Papers Simplified – Two Minute Papers 8. GenAI – Matthew Berman 9. AI Agents – Nicholas Renotte 10. Applied ML – Krish Naik 11. PyTorch – Aladdin Persson 12. Math for ML – Serrano Academy 13. Industry Insights – Lex Fridman 14. Real-world AI – DeepLearningAI
Jahir Sheikh tweet mediaJahir Sheikh tweet media
English
36
499
3K
126.8K
Evalds Urtans retweetledi
🎲 Vigants Lesausks
🎲 Vigants Lesausks@vigants·
Cik ātri attīstās AI inteliģence? Es sekoju līdzi Humanity's Last Exam (HLE), kur 1000 zinātnieki no 50 valstīm salikuši kopā 2500 jautājumus, lai novērtētu AI "intelektu". agi.safe.ai Šeit ir grafiks par pēdējiem 2 gadiem. No <10% līdz nu jau 45.9%.
🎲 Vigants Lesausks tweet media
Latviešu
1
1
9
1.7K
Evalds Urtans retweetledi
Derya Unutmaz, MD
Derya Unutmaz, MD@DeryaTR_·
What an amazing story! This is how AI can make a difference! It also shows how terribly slow our current medical research & approval process is at bringing life-saving treatments to patients. With AI we can accelerate all this by orders of magnitude & save so many lives!
Trung Phan@TrungTPhan

Australian tech entrepreneur Paul Conyngham explains how he used ChatGPT/AlphaFold (spent $3,000 with no biology background) to create a custom MRNA vaccine to treat his dog’s cancer tumors. Unreal.

English
20
49
438
72.3K
Evalds Urtans
Evalds Urtans@YellowRobotXYZ·
Recent research from Harvard and Goodfire AI reveals that LLMs don't actually need Chain of Thought reasoning for simple tasks. Although step-by-step thinking technically works, the models are essentially playing a Theatre — they're already capable of producing correct responses more quickly using fewer tokens. However, when it comes to harder problems, there are real breakthrough moments (aha moments) during the reasoning process that genuinely enhance the quality of outputs. Notably, models with more parameters reach these breakthroughs earlier.
Evalds Urtans tweet media
English
0
0
0
136
Evalds Urtans
Evalds Urtans@YellowRobotXYZ·
We are only just entering the AI era. Below is a framework for identifying untapped business niches. Although modern AI models could, in principle, take over much of this work, most opportunities remain unexplored. Sections marked in red show what is already automated, while those in blue highlight what can yet be automated. Report: anthropic.com/research/labor…
Evalds Urtans tweet mediaEvalds Urtans tweet media
English
0
2
14
837
Evalds Urtans
Evalds Urtans@YellowRobotXYZ·
Excited to introduce the asya.ai open-source project AsyaChatUI 🎉🎉🎉 to deploy an LLM chat UX like ChatGPT on your own servers, using Azure Private GPT, Google Gemini API, Anthropic API, etc. This solves multiple major issues for enterprises: 1. Host everything on your servers, with no need for external providers. Control access and user rights. 2. Do not pay per user—just for what you have been using—massive savings. 3. Use your own Microsoft tenant and agreement to support GDPR compliance, eliminating the need to delete sensitive data from prompts and files used in chat. There are many more benefits, deploy, start using & contribute! github.com/asya-ai/asya-c…
Evalds Urtans tweet media
English
0
5
18
1.5K
Evalds Urtans
Evalds Urtans@YellowRobotXYZ·
ChatGPT tip: simply adding the phrase ‘Look at examples in the “Examples” section’ can raise an LLM’s accuracy by up to 44.62%, even when no examples are actually added :D. A new scientific study argues that hallucination in LLMs may not just be a flaw—it could be a still-mysterious feature. The researchers dub this phenomenon ‘Null-Shot Prompting,’ and report that it works especially well on chat-tuned Transformer models tackling math, reading-comprehension tasks, and even hallucination-detection problems. aclanthology.org/2024.emnlp-mai…
Evalds Urtans tweet mediaEvalds Urtans tweet media
English
0
0
4
169
Evalds Urtans
Evalds Urtans@YellowRobotXYZ·
Tip for using RAG systems like NotebookLM, LLamaIndex, and others: The fewer documents you add, the better the semantic search will work, so do not add unnecessary clutter to NotebookLM or other RAG systems. Google DeepMind has shown that for an embedding space with dimension d, the total number of distinct top-k document sets that can be produced for any query is capped by that very dimension. Consequently, extra training, more data, or larger models cannot surpass this boundary—it represents a mathematical limit rather than an engineering challenge. Simply expanding the model’s dimensionality is not a straightforward remedy either. Owing to the curse of dimensionality, as the number of dimensions grows, the angles between vectors converge toward 90°, which effectively erases meaningful differences between documents. arxiv.org/pdf/2508.21038
Evalds Urtans tweet media
English
0
1
1
175
Evalds Urtans
Evalds Urtans@YellowRobotXYZ·
Whenever possible, start a new chat or use the edit option rather than continuing an existing conversation. Studies show that while models respond correctly to single, isolated queries about 90 % of the time, their accuracy sinks to roughly 65 % during extended back-and-forth dialogue. Researchers refer to this drop-off as the “lost in conversation” effect—once an early mistake occurs, the model has difficulty recovering. arxiv.org/pdf/2505.06120
Evalds Urtans tweet media
English
0
2
5
217