Yuchang Wu

79 posts

Yuchang Wu banner
Yuchang Wu

Yuchang Wu

@yy_stat

Biostatistician @ University of Wisconsin-Madison. Scientist @ Qiongshi Lu's group. 子曰:“学而不思则罔,思而不学则殆。”

Wisconsin, USA Katılım Nisan 2019
314 Takip Edilen173 Takipçiler
Yuchang Wu
Yuchang Wu@yy_stat·
@AlexTISYoung Claude is really good at software engineering. Tried both paid version of Gemini (edu version) and ChatGPT. I think Gemini is better for scientific research. But it needs very precise prompting, so I usually first ask it to optimize my prompt, then answer its own prompt.
English
0
0
1
46
Alex Strudwick Young
Alex Strudwick Young@AlexTISYoung·
Surprised Gemini is not getting much love given recent benchmark results. I wonder why?
English
1
0
1
396
Alex Strudwick Young
Alex Strudwick Young@AlexTISYoung·
Which AI pro subscription should I get for advanced technical/scientific research?
English
2
1
6
1.1K
Yuchang Wu
Yuchang Wu@yy_stat·
@SashaGusevPosts edu email account can get an edu version of Gemini, which should have less hallucinations.
English
0
0
1
18
Sasha Gusev
Sasha Gusev@SashaGusevPosts·
There are still some pitfalls. It hallucinates in ways that I could see being a major time sink. I recently asked it to simulate stabilizing selection but it coded up directional selection in a way that would not have been obvious without already knowing the right math.
English
2
0
14
1.7K
Sasha Gusev
Sasha Gusev@SashaGusevPosts·
I've gradually started using Gemini as a kind of online notepad. By taking notes directly in the prompt it is then trivial to ask the AI to identify any immediate problems, do a lit review, implement a toy simulation, outline a talk, suggest ways to visualize the results, etc.
English
4
4
69
7.9K
Jiacheng Miao
Jiacheng Miao@Jiacheng_Miao·
🚀 Introducing Paper2Agent - a system that turns static research papers into interactive AI agents you can chat with and use the tools/data in the paper. Just ask, “Run this paper’s method on my data.” The paper agent will handle the task for you. 1/n
Jiacheng Miao tweet media
English
18
92
512
39.3K
Yuchang Wu
Yuchang Wu@yy_stat·
@dougthespeed We first copy each chromosome in plink format to active environment, extract HM3 SNPs, and then merge all the HM3 SNPs from each chromosome. The copying process is slow (~1h for one larger chr?), but doable. You can run the job in background. Hail is always painful to me.
English
1
0
2
104
Doug Speed
Doug Speed@dougthespeed·
StatGen Question - Help with AllofUs data. We would like to extract genotypes for all individuals and ~1M HapMap3 SNPs in PLINK (1) format (which we can then use for future analyses). Has anyone done this and can offer advice?
English
5
0
8
1.3K
Yuchang Wu
Yuchang Wu@yy_stat·
@AlexTISYoung sorry to hear this. Hope you'll have a speedy and full recovery!
English
0
0
1
72
Alex Strudwick Young
Alex Strudwick Young@AlexTISYoung·
In March last year, at the age of 35, I was diagnosed with advanced stage III rectal cancer with a metastasis in my liver. This was a shock: I had no family history, and none of the doctors suspected it. In fact, I'd had a negative occult blood test when I went to ER with severe digestive symptoms a few months before. The first lesson I learnt from this was that there's no substitute for the proper diagnostic procedures: if you have any bleeding, get a colonoscopy, even if you don't have family history. I was treated with total neoadjuvant therapy at UCLA. I had 6 weeks of combination radiation and chemotherapy (Xeloda). The photo shows me after completing that first step of my treatment along with my parents in Yosemite. My mother has already survived two breast cancers, and my father had recently had a knee replacement (at the age of 70), so getting all three of us to the top of Sentinel Dome felt like an achievement. The chemoradiation was followed by 6 rounds of combination chemotherapy: oxaliplatin infusions followed by two weeks of Xeloda. I tolerated this unusually well. During this period, I managed to hike up to over 11,000ft multiple times in the Sierras, and I wrote the main text of a 72 author meta-analysis paper and submitted it to Nature where it has passed first round reviews (currently working on revisions). I showed a good clinical response, with the tumor in my rectum nearly completely disappearing and the liver metastasis shrinking substantially. At the end of October I had surgery: I was under for 9 hours, and they cut out a large bit of my rectum, 1/3rd of my liver, and gave me a temporary ileostomy. Waking up from this was probably the strangest experience of my life: I felt more machine than man, and I had wild hallucinations form such a large dose of anesthesia. It took a while to feel OK after that surgery. However, I still had the ostomy. I had that reversed at the end of January in another surgery, which took a greater toll on me than I expected. There's a cumulative effect of having so many major medical interventions, and it reveals anything weak in body or mind. Surprisingly, my mind held up well during this process. This was thanks to the love and support I received from friends and family and the great care I received from UCLA — in particular Dr Anand (medical oncology), Dr Kazanjian (colorectal surgery), Dr Agopian (liver surgery), and the radiation oncology team. I only just started to feel like I was starting to feel OK again and I got bad news: I got a positive circulating tumor DNA test (ctDNA), the signatera test from natera. Unfortunately, the second test I had done recently showed the level of ctDNA in my blood is increasing. This means disease recurrence is almost certain, likely within a year from the first positive test. Since then I've been put on celecoxib based on recent data indicating this can reduce risk of disease recurrence, although it doesn't seem to agree to well with my digestion. I've had CT, MRI, colonoscopy and there's been nothing to see, but it's only a matter of time. My oncologist thinks it is very likely (90%+) that it will be a local recurrence in my liver, which should be fairly easy to cure. But there's a small chance it is something worse, even potentially incurable. This is the reality of cancer for many patients: years of uncertainty. It's still not clear why this happened. Genetic testing returned nothing. My polygenic risk score (PRS) — something close to my own research — gave me totally average risk, at least according to 23andMe. I've always been slim, fit, eaten a pretty healthy diet. I was even raised vegetarian by hippy parents. I probably drank and partied more than is medically advised, but nothing extreme. However, there's been a well-documented uptick in cases like mine. A recent paper indicated this may be due to colobactin, a bacterial mutagen associated with E. coli among other bacteria. I may look at my Tempus tumor data to see if the somatic mutation in the APC gene they found (the only driver mutation) has a signature matching colobactin. If anyone knows anyone with a worthwhile expert opinion on how to manage my situation I'd be interested to hear!
Alex Strudwick Young tweet media
English
72
75
678
143.2K
Yuchang Wu
Yuchang Wu@yy_stat·
@SashaGusevPosts good question. I don't know since I'm not closely following on this topic.
English
0
0
1
98
Sasha Gusev
Sasha Gusev@SashaGusevPosts·
@yy_stat Thanks, I hadn't seen this before, great commentary. Has anyone done the analysis of real couples since then?
English
1
0
4
287
Sasha Gusev
Sasha Gusev@SashaGusevPosts·
The New York Times recently had an article on IVF and embryo selection which I think buries a few important ledes about these products ...
Sasha Gusev tweet media
English
11
42
286
38.6K
Yuchang Wu
Yuchang Wu@yy_stat·
update: The filter function on data browser does not give accurate results: some categories and some variants meet the criteria are missing. Guess we have to use VAT for more serious work.
English
0
0
1
292
Yuchang Wu
Yuchang Wu@yy_stat·
Wow! Need a LoF variant list in AllofUs. Filter the giant variant annotation table? Hours of waiting! Ended up copying and pasting data browser results to DeepSeek and in 10 seconds I got a clean tab-separated file! Cool!
English
1
0
4
432
Yuchang Wu
Yuchang Wu@yy_stat·
@SashaGusevPosts Does the environment correlate with genetics? Otherwise, I don't have the intuition why GWAS could be problematic.
English
1
0
0
110
Sasha Gusev
Sasha Gusev@SashaGusevPosts·
Run a GWAS that doesn't control for population stratification. The GWAS will orient all of the drifted alleles towards the environmental differences and the resulting polygenic score will massively exaggerate population genetic differences that do not actually exist.
Sasha Gusev tweet media
English
3
3
33
2.7K
Sasha Gusev
Sasha Gusev@SashaGusevPosts·
How population stratification makes environments look like genes. A short 🧵:
Sasha Gusev tweet media
English
6
78
343
33K
Yuchang Wu
Yuchang Wu@yy_stat·
@geyu_zhou yes; I was too excited to see the job done and totally forgot to copy the results to bucket
English
0
0
0
95
Geyu Zhou
Geyu Zhou@geyu_zhou·
@yy_stat Using dsub and writing to the workspace bucket is a safer approach.
English
1
0
0
71
Yuchang Wu
Yuchang Wu@yy_stat·
pain in using cloud computing: took 1 day 10 hours and 44 minutes to finish some analysis using AllofUs data, and I accidentally deleted the "environment" and all the results are gone forever ... 😭😭😭
English
3
1
3
981
Dajiang Liu
Dajiang Liu@dajiangliu81·
@AllofUsResearch @AllofUsCEO Want to bring this to your attention. We set up a workspace without linking a GCP account. We are doing an extensive computational project. No one notified us anything until yesterday and we were told that our data would be deleted in 24 hours. 1/n
English
5
3
7
2.6K
Yuchang Wu retweetledi
Jiacheng Miao
Jiacheng Miao@Jiacheng_Miao·
#ASHG24 Interested in how to reliably use AI/ML to accelerate genetic discovery? Stop by poster 4151 this afternoon to chat!
Jiacheng Miao tweet media
English
2
8
19
2.2K
Yuchang Wu
Yuchang Wu@yy_stat·
While we didn't explore it in this paper, another tool (POP-GWAS) from our lab (x.com/Q_StatGen/stat…) may offer a better solution when dealing with phenotypes with biases. --- also check it out!
Qiongshi Lu@Q_StatGen

POP-GWAS led by @Jiacheng_Miao is published today in @NatureGenet. This work showcases modern data science leveraging the power of machine learning while maintaining statistical validity. It also provides a GWAS solution when the outcome is ML-predicted nature.com/articles/s4158…

English
0
1
3
287
Yuchang Wu
Yuchang Wu@yy_stat·
Thrilled to see this published on Nature Genetics! We found weird results using AD GWAX in another project and discovered that the bias is actually a very serious issue in the AD field. This underscores the importance of data quality. Larger is not always better!
Qiongshi Lu@Q_StatGen

Our paper on misleading biases in AD GWAS-by-proxy is published @NatureGenet. We identify the source of biases and explore strategies to reduce them @yy_stat @SunZhongxuan Paper📰: doi.org/10.1038/s41588… Software🧑‍💻: github.com/qlu-lab/GSUB Sumstats⬇️: qlu-lab.org/data.html

English
1
5
28
3.1K