Chao Chen

1.7K posts

Chao Chen

Chao Chen

@CrazyJvm

CEO of AI startup

Katılım Mart 2012
817 Takip Edilen232 Takipçiler
Chao Chen retweetledi
LangChain
LangChain@LangChain·
📚 🔍 bRAG: Complete RAG Guide A comprehensive project showcasing RAG implementations with LangChain - from basics to advanced features like multi-query retrieval, ColBERT indexing, and RAG-Fusion. Check out this 1.7K+ starred guide 🚀 github.com/bRAGAI/bRAG-la…
LangChain tweet media
English
8
236
1.3K
158.8K
Chao Chen retweetledi
Yangyi
Yangyi@yangyi·
最近大家说的热火朝天的MCP,到底有什么神奇功效? 一起看看来自AI Jason的这期讲解,LLM+MCP直接起飞
中文
42
163
799
101K
Chao Chen retweetledi
GitHubDaily
GitHubDaily@GitHub_Daily·
微软出了一门给初学者学习的 AI 智能体课程:AI Agents for Beginners。 共 10 节课程,涵盖构建 AI 智能体的所有基础知识,旨在教授我们从零开始构建一个AI智能体。 GitHub:github.com/microsoft/ai-a… 课程内容已做了中文翻译,学习起来更加轻松,同时提供每节课所使用的示例代码,方便我们运行。
GitHubDaily tweet media
中文
5
277
839
99.1K
Chao Chen retweetledi
Shubham Saboo
Shubham Saboo@Saboo_Shubham_·
I built a Deepseek R1 RAG Reasoning Agent running locally on my computer. It's an Agentic RAG reasoning agent that can think, reason and fall back to web search if needed. 100% Opensource code with step-by-step tutorial.
English
35
164
1.1K
124.5K
Chao Chen retweetledi
WY
WY@wangyuanzju·
大模型要回答好问题,最重要的是要先认真学习 很多优化方法都可以归结这一点,如: - Anthropic提出的Context Retrieval:anthropic.com/news/contextua… - Jina团队提出的Late Chunking:jina.ai/news/late-chun… - RAGFix团队提出的Fully-Formatted Facts:@JamesStakelum/the-end-of-ai-hallucinations-a-breakthrough-in-accuracy-for-data-engineers-e67be5cc742a" target="_blank" rel="nofollow noopener">medium.com/@JamesStakelum… 这些方法做的都是通过上下文彻底理解文中每一段、每句话、每个词。 下面这篇论文提出的先对文档造QA对,回答时召回问题和答案而不是召回原始文档chunk,原理也是类似:arxiv.org/pdf/2408.09017 到目前为止,我们看到的这些方法都还是基于当个文档的。但我们知道很多时候单看一篇文章是看不懂的,还需要主题学习。 可以预期,下一阶段的研究会发展到怎么让AI进行有效的主题学习。
中文
4
41
124
13.6K
Chao Chen retweetledi
Swapna Kumar Panda
Swapna Kumar Panda@swapnakpanda·
9 FREE Books from MIT for Absolute Beginners - Artificial Intelligence (AI) - Machine Learning (ML) - Deep Learning (DL) - Reinforcement Learning (RL)
Swapna Kumar Panda tweet mediaSwapna Kumar Panda tweet mediaSwapna Kumar Panda tweet mediaSwapna Kumar Panda tweet media
English
98
593
3.6K
311.6K
Chao Chen retweetledi
Tw93
Tw93@HiTw93·
浙江大学出的这个开源的书籍「大模型基础」值得一看,行文风格挺不错的,易读、严谨、有深度的大模型教材。 github.com/ZJU-LLMs/Found…
Tw93 tweet media
中文
28
709
2.6K
353.1K
Chao Chen retweetledi
Avi Chawla
Avi Chawla@_avichawla·
KV caching in LLMs, clearly explained (with visuals):
English
21
309
2.6K
521.8K
Chao Chen retweetledi
Yung-Sung Chuang
Yung-Sung Chuang@YungSungChuang·
(1/5)🚨LLMs can now self-improve to generate better citations✅ 📝We design automatic rewards to assess citation quality 🤖Enable BoN/SimPO w/o external supervision 📈Perform close to “Claude Citations” API w/ only 8B model 📄arxiv.org/abs/2502.09604 🧑‍💻github.com/voidism/SelfCi…
Yung-Sung Chuang tweet mediaYung-Sung Chuang tweet media
English
12
73
314
39.4K
Chao Chen retweetledi
Olivert
Olivert@indiehackercase·
这份文件真是清杀疯了!清华大学104页《DeepSeek:从入门到精通》。短短四五天,就有50万播放的视频了。资料链接:pan.quark.cn/s/4691006cb600
Olivert tweet media
中文
1
122
433
45.2K
Chao Chen retweetledi
Rohan Paul
Rohan Paul@rohanpaul_ai·
Github 🤖: Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, and BI. 📊 Helps you chat with data to generate SQL, charts, and reports, using your choice of LLM. It provides an open-source GenBI solution for data-driven teams seeking insights without code. What it offers: → Wren AI is an open-source GenBI AI Agent that enables data-driven teams to interact with their data through chat. → It generates Text-to-SQL queries, charts, spreadsheets, reports, and BI insights. → It supports multiple LLMs including OpenAI, Azure OpenAI, DeepSeek, Google Gemini, Vertex AI, Bedrock, Anthropic, Groq, Ollama, and Databricks. → Wren AI allows users to ask data questions in multiple languages and provides AI-generated summaries and visualizations of query results. → It features AI-powered data exploration, semantic indexing for context, and allows exporting data to Excel and Google Sheets.
Rohan Paul tweet media
English
7
128
672
57.2K
Chao Chen retweetledi
歸藏(guizang.ai)
歸藏(guizang.ai)@op7418·
新发布的最强开源语音模型 Zonos 语音生成质量非常高,而且这次有中文 - 两种1.6B 模型,transformer 和 SSM - 用5到30秒的语音进行高保真语音克隆 - 可以调节速度,音高,音频质量和情绪 - 添加文本和音频前缀,实现更丰富的说话人匹配效果 -在 RTX 4090 显卡上运行时,实时率约为 2 倍
中文
39
218
859
93.2K
Chao Chen retweetledi
Rishabh Agarwal
Rishabh Agarwal@agarwl_·
I recently gave a tutorial on knowledge distillation for LLMs, explaining the mathematical derivations behind the commonly used methods. Sharing the slides here given the recent interest in this topic. drive.google.com/file/d/1xMohjQ…
Rishabh Agarwal tweet media
English
22
200
1.3K
196.9K
Chao Chen retweetledi
ℏεsam
ℏεsam@Hesamation·
DeepSeek by hand is just another level, @ProfTomYeh made a lecture on it: - Multi-Head Attention - Multi-Head Latent Attention - Single Expert - Mixture of Experts - Sparse Mixture of Experts - Shared+Routed Mixture of Experts - RoPE
English
11
203
1.4K
72.6K
Chao Chen retweetledi
munen
munen@munen5647·
Nobody can explain Transformers and Self-Attention like professor Bryce... He's one of the Hidden gems of YouTube, his all videos are packed with knowledge and he teach with enthusiasm and dedication. Below 👇🏻 link in comments.
munen tweet media
English
11
582
5K
2.3M