micro1

1.1K posts

micro1 banner
micro1

micro1

@micro1_ai

The AI platform for human intelligence

San Francisco, CA Katılım Ağustos 2022
0 Takip Edilen9.3K Takipçiler
micro1
micro1@micro1_ai·
In recognition of National Cancer Prevention and Early Detection Month, join us for an important conversation on how AI is reshaping the future of cancer care. From accelerating drug discovery to enabling more accurate, scalable diagnostics, artificial intelligence is unlocking new possibilities across prevention, early detection, and treatment. We’ll also dive into the real challenges, data quality, bias, interpretability, and bridging the gap between research breakthroughs and real-world clinical impact. Featuring: •Virginie Buggia-Prevot, PhD (Executive Director, @ValoHealth) •Bahar Rahsepar, PhD (Associate Director of Product, @Path_AI) •Paola Rodríguez - MD, Eng, MSc. (Director of Medical Research, @micro1_ai) Moderated by @Exp_Mark (Chief Economist, micro1) This session brings together leading voices at the intersection of AI and healthcare to explore how human + AI are transforming patient outcomes. Join us on 4/28, 10am PT: us06web.zoom.us/webinar/regist…
micro1 tweet media
English
3
6
16
794
micro1
micro1@micro1_ai·
Dan Heffernan has led sales teams at some of the biggest names in tech and now is making his mark in AI. In this conversation, he breaks down why the human element is the secret behind the best models, and how he's putting that belief into action at micro1 while training AI. Watch the full interview on YouTube now! (Link in the comments)
English
7
6
23
1.7K
micro1
micro1@micro1_ai·
micro1 x Crosby: AI Fellowship for SaaS Contracting Attorneys We've teamed up with @crosbylegal to launch an AI Fellowship for SaaS Contracting Attorneys, and we're looking for attorneys with deep expertise in tech transactions to help us shape how AI handles real legal work. Here's what the fellowship looks like: - Simulated contract negotiations and redlining exercises - Evaluating AI-generated suggestions for accuracy and legal soundness - Collaborating with product and research teams to improve AI outputs This is a part-time, fully remote opportunity paying $80-$105/hr. Apply now at the link in the comments.
micro1 tweet media
English
7
14
56
5.2K
micro1
micro1@micro1_ai·
This Tuesday at 11:00 AM PT, micro1 is hosting a conversation on The Human Foundation of AI in Healthcare on the micro1 Forum. Moderated by @Exp_Mark (Chief Economist at micro1) this session brings together Paola Rodríguez - MD, Eng, MSc. (Director of Medical Research, micro1), Sam Hashemi (VP at @prenuvo), and David Q. Sun (VP of AI/ML at @eightsleep) to explore how human intelligence shapes the future of healthcare AI. As AI systems evolve from static tools to more agentic, decision-supporting systems, one thing is clear: the future of healthcare won’t be defined by automation alone, but by how effectively humans and machines work together. This session is based on their recent co-authored research paper: micro1.ai/research/the-h… Register for the live event to hear from the authors themselves: micro1.ai/forum/the-huma…
micro1 tweet media
English
2
4
16
1.4K
micro1
micro1@micro1_ai·
Human-first AI ❤️ Last Friday we hosted an after office in Buenos Aires with 100+ experts from the micro1 community. A great chance to step away from the screen, connect in person, and spend time with the incredible people contributing to AI training projects across our platform. Thanks to everyone who joined and made it such a great evening!
micro1 tweet mediamicro1 tweet media
English
4
6
34
1.4K
micro1
micro1@micro1_ai·
Tune in with @AndrewLeeMaas & @Box 👇
Box@Box

Most enterprises think non-deterministic AI outputs mean they can't trust agent workflows. Andrew Maas, VP of AI at @micro1_ai, disagrees and explains exactly how to engineer reliability into agentic systems on the latest Partner Podcast with our CTO @BenAtBox. Timestamps 02:54 What micro1 does and the role of human experts in AI systems 04:13 Rise of multi-step agentic workflows and domain-specific AI capabilities 07:48 Limits of current models and the need for deeper domain expertise 08:12 One-shot vs multi-step AI reasoning and why it matters 10:07 Composing multiple LLM steps to create reliable enterprise workflows 13:22 Variability in LLM outputs and concerns about enterprise reliability 18:54 Files as the new interface between humans and AI agents 22:24 Using evals and human review to improve AI systems in production 26:30 Experiment and challenge assumptions about AI limits

English
0
1
10
1.3K
micro1
micro1@micro1_ai·
This Friday at 9:00 AM PT, Chief Economist at micro1, @Exp_Mark will be joined by Victoria (Tori) Westerhoff (Principal AI Security & AI Red Team at @Microsoft) and Liu Zhang, Member of Technical Staff at micro1) on the micro1 forum to explore red teaming for agentic AI systems. We’ll dive into how agentic systems fail in practice, from prompt injection and tool misuse to complex multi-step breakdowns, and how leading teams are advancing red teaming with continuous testing, expert evaluation, and large-scale adversarial simulations. Register here: us06web.zoom.us/webinar/regist…
micro1 tweet media
English
1
2
13
1K
micro1
micro1@micro1_ai·
Every breakthrough in healthcare AI is built on a foundation of human expertise. We collaborated with our friends at @eightsleep and @prenuvo on a write-up exploring where medical AI is heading. The article covers three angles: 1) How human expertise shapes reliable clinical AI 2) What continuous biosignal data from sleep can tell us about long-term health 3) How imaging is evolving from a one-time diagnostic into a longitudinal health map Full article linked in the comments.
micro1 tweet media
English
7
11
26
1.9K
micro1
micro1@micro1_ai·
The micro1 referral program has now surpassed 1,000,000 referrals 🚀 We're hiring experts in medical, legal, finance, STEM, coding, and more to help train AI models. Know someone who might be a good fit? Send them our way and earn $100–$3,000 per successful hire. Link to register in the comments.
English
12
17
113
16K
micro1
micro1@micro1_ai·
Prospera: the new standard for evaluating tax reasoning in AI
Ali Ansari@aliansarinik

Introducing Prospera: a benchmark that tests AI agents on real federal tax returns, designed by our research team in collaboration with CPAs and industry-leading tax professionals. A complete federal return requires dozens of source documents, hundreds of interdependent calculations, and no room for errors. We evaluated GPT-5.4, Claude Opus 4.6, and Gemini 3.1 Pro with no hints on which forms to file, scored against 20+ expert-authored criteria per return. Here’s the Results (Pass@3): -GPT-5.4: 28% -Gemini 3.1 Pro: 18% -Claude Opus 4.6: 16% To put those numbers in context, the tasks in Prospera weren't obscure edge cases. Filing a federal tax return is something millions of Americans do every year, yet 44% of evaluation criteria failed across all models. Full report linked in the comments.

English
1
1
11
1.3K
micro1 retweetledi
Descope
Descope@descopeinc·
Agentic AI adoption is moving faster than most security and compliance teams can keep up with. We collaborated with @micro1_ai to cover the processes and technologies organizations need to achieve holistic visibility over their agentic flows–from inception to deployment. 🧵👇
Descope tweet media
English
2
3
10
586
micro1
micro1@micro1_ai·
The best place to train AI models. Apply today.
English
12
26
157
12.8K
micro1
micro1@micro1_ai·
On Tuesday at 12:30 PM PT, Chief Economist at micro1, @Exp_Mark, will be joined by Khyati Jain (Software Engineer at @GoogleDeepMind) and Dr Xin (Skye) Zhao (@Microsoft AIEI Fellow) on the micro1 forum to discuss the future of AI in education. We’ll cover how AI is used in classrooms today, how to design systems that support teachers, and how to measure real learning outcomes at scale. Register here: micro1.ai/forum/the-futu…
micro1 tweet media
English
3
5
23
2.1K
micro1
micro1@micro1_ai·
micro1 Cortex: the human intelligence layer for high-performing AI agents. Most agents look good in demos, but break in production. - Benchmarks don’t reflect real workflows - Internal testing misses edge cases - Failures are hard to diagnose - Performance degrades as you scale Cortex leverages expert human judgement to evaluate, train, and continuously refine agents so they deliver exceptional performance in real-world workflows and drive measurable outcomes in production.
English
9
13
40
3.4K
kane🐐🔺️
kane🐐🔺️@kane_120·
I just got paid for my micro1 job🎉. This is the latest payment ever because it was stated that payment will come in on the 20th but today is 19th and payment is already dropping. Payment is done twice in a month for per completed hr of work.
kane🐐🔺️ tweet media
English
25
1
58
7.2K