Bioecomatics

5.6K posts

Bioecomatics

Bioecomatics

@bioecomatic

Applied BiomedicoClinical Informatics BioSystemsInternalMedicine CM2EnvM,OrganSys2Epigenetics,NueOx(aVIZaid2NuerOncoSTx) PsySEMECEpi/Bioecological LangofOrgeff

Tokyo-to, Japan Se unió Şubat 2024
1.6K Siguiendo113 Seguidores
Bioecomatics retuiteado
Marios Georgakis
Marios Georgakis@MariosGeorgakis·
Performance of polygenic risk scores for height, weight, and BMI calculated using European-derived GWASs across UK Biobank (UKBB) ancestries and an Indian population (GenomeIndia, GI)
Marios Georgakis tweet media
English
1
8
22
2K
Bioecomatics retuiteado
Math Hub
Math Hub@mathhub_vn·
Quantum Evolution
English
0
4
24
2.6K
Bioecomatics retuiteado
Biology+AI Daily
Biology+AI Daily@BiologyAIDaily·
AlphaFold Database expands to proteome-scale quaternary structures 1 AFDB is extended beyond monomers by adding 1,754,242 high-confidence predicted protein complexes (primarily homodimers), enabling proteome-scale access to quaternary structure models with standardized formats, metrics, and metadata for search/visualization/bulk use. 2 The study runs a large-scale prediction campaign covering ~31M complexes from 4,777 proteomes: 23,441,822 homodimers derived from UniProt proteomes, plus 7,620,644 heterodimer candidates derived from STRING “physical interaction” links for 16 model organisms and 30 WHO-prioritized global health proteomes. 3 A key methodological contribution is confidence calibration for complex models using interface-focused signals. The authors evaluate multiple metrics and settle on a practical high-confidence rule: ipSAEmin ≥ 0.6, pLDDTavg ≥ 70, and backbone clashes ≤ 10, benchmarked against post-2021 PDB homodimers (positives) and PDB monomers (negatives). 4 Among interface metrics tested (ipTM, ipSAEmin, LISmin, pDockQ2), ipSAEmin provides the clearest separation between true homodimers and monomers and shows a stable F1 plateau up to the chosen cutoff (precision ~0.859, recall ~0.655, F1 ~0.744), motivating its use as the primary AFDB-facing filter. 5 The resulting high-confidence set retains ~7% of predicted homodimers (~1.8M out of ~23M), and AFDB further labels entries by ipSAEmin into “very high-confidence” (≥0.8), “confident” (0.7–<0.8), and “low-confidence” (0.6–<0.7) to help non-expert users interpret interfaces. 6 Compared to experimentally determined multimers in the PDB, the high-confidence predicted complexes increase structural coverage by 1–3 orders of magnitude for most organisms, with especially large gaps being bridged in Metazoa and Viridiplantae (exceptions are heavily studied species like human, E. coli, and yeast). 7 Dataset consistency is assessed at scale: after clustering sequences at 98% identity/95% coverage and aligning complexes within clusters, 95.9% of aligned complexes show complex TM-score > 0.8; additionally, chain A vs chain B within homodimers yields 98.81% with TM-score > 0.8, supporting internal coherence of the predictions. 8 Prediction yield varies across taxonomy: archaea and bacteria show >3× higher high-confidence homodimer rates than eukaryotes, consistent with shorter/more compact prokaryotic proteins and a higher prevalence of homo-oligomeric assemblies, while eukaryotic proteins are often longer, multi-domain, and more disordered and may preferentially form heteromers. 9 For heterodimers, applying the same thresholds produces 56,956 “tentatively high-confidence” models from 7,620,644 STRING-derived candidates. High-confidence likelihood correlates with STRING score, but also with homodimer-like properties (higher inter-chain sequence identity and smaller chain-length differences), motivating further calibration tailored to heteromer biology. 10 Structural clustering of 1,811,201 predicted complexes (high-confidence homodimers + tentative heterodimers) compresses the space ~8-fold into 224,862 clusters; complex topology is highly recurrent (top 1% of non-singleton representatives cover ~25% of entries; top 20% cover ~82%), and ~9% of non-singleton clusters span multiple superkingdoms, suggesting conserved complex building blocks across deep evolution. 11 Case studies highlight “emergent” structures only visible in oligomeric context: a domain-swapped fold in Dictyostelium Q55DI5 becomes high-confidence only as a dimer; a fungal membrane protein (Atg33) gains a more coherent assembly and clearer membrane boundaries as a dimer; multimer prediction can also refine inter-domain architecture even when monomer predictions are already confident, and can partially rescue low-confidence monomers (e.g., an HTH regulator consistent with dimeric function). 12 Engineering/scaling details matter for feasibility at this size: MMseqs2-GPU MSAs (best hit per taxon as an orthology-like filter), AlphaFold-Multimer inference via accelerated ColabFold/OpenFold (TensorRT + cuEquivariance), batching strategies to reduce recompilations, and a post-processing pipeline producing AFDB-compliant mmCIF/BCIF plus computed interface metrics and clash scores. 📜Paper: biorxiv.org/content/10.648… #AlphaFold #ProteinComplexes #StructuralBioinformatics #Interactome #ComputationalBiology #Bioinformatics #ProteinStructure #AIforScience #OpenScience
Biology+AI Daily tweet media
English
0
8
31
1.6K
Bioecomatics retuiteado
Institute of Art and Ideas
The idea that the universe is fundamentally mathematical is one of the leading frameworks in modern thought. | iai.tv/articles/mathe… But physicist Martín López Corredoira invites us to reconsider, questioning whether mathematics describes the world as it is, or as we are able to conceive it.
Institute of Art and Ideas tweet media
English
7
10
39
1.5K
Bioecomatics retuiteado
@BioAI_Neuro
@BioAI_Neuro@NeuroAI_Nexus·
Grand Challenges for Predictive Modeling in Small Molecule Drug Discovery chemrxiv.org/doi/full/10.26… In the study. authors defines a set of Grand Challenges for predictive modeling in small-molecule drug discovery, aiming to clarify where computational methods can most meaningfully advance real drug discovery. Rather than providing a broad methodological review, the paper identifies specific scientific and technical barriers that currently limit progress and proposes measurable benchmarks to evaluate future advances. The goal is to align researchers, industry leaders, and investors around problems that have direct impact on drug discovery outcomes. The challenges are organized into four core domains: Chemistry, Structure, Energy, and Pharmacology. For each domain, the authors define key problems, describe the underlying physical and biological principles, explain their relevance to drug discovery decision-making, assess the current state of the field, and propose quantitative metrics for measuring progress. By focusing on well-defined problems rather than general technological trends, the work aims to bridge the gap between rapidly advancing computational tools (including AI, machine learning, robotics, high-performance computing, and quantum computing) and the practical needs of medicinal chemistry and pharmacology. The overarching objective is to catalyze measurable, sustained progress in computational drug discovery rather than short-lived advances driven by enthusiasm or benchmarking alone. #BioAI #AI_drug_discovery
@BioAI_Neuro tweet media
English
0
5
20
1.1K
Bioecomatics retuiteado
Eric Topol
Eric Topol@EricTopol·
Artificial Biological Intelligence (ABI) In a post-Darwinian era of being able to write genomes, the implications—both for good and harm—are profound. In conversation with @AdrianWoolfson on his new book On the Future of Species erictopol.substack.com/p/on-the-futur…
Eric Topol tweet media
English
12
24
111
28.5K
Bioecomatics retuiteado
Eric Topol
Eric Topol@EricTopol·
Intensity of exercise vs volume of physical activity made a difference for lower risks of 8 diseases and all-cause mortality among 96,000 @uk_biobank participants, especially noted for immune-mediated (IMID). VPA-vigorous physical activity academic.oup.com/eurheartj/adva…
Eric Topol tweet media
English
26
85
453
139.3K
Bioecomatics retuiteado
Bioecomatics retuiteado
International Cyber Digest
International Cyber Digest@IntCyberDigest·
🚨‼️ BREAKING: Databricks allegedly compromised in a TeamPCP supply chain attack. Databricks is the leading cloud-based data analytics platform: used by organizations worldwide to manage massive datasets. We notified them last week. They scaled up to investigate. We haven't heard back since.
International Cyber Digest tweet mediaInternational Cyber Digest tweet media
English
36
203
1.2K
186.5K
Bioecomatics retuiteado
Salesforce AI Research
Salesforce AI Research@SFResearch·
As frontier models mature, the hardest enterprise challenges live at the system level. AI Foundry brings together research, customers, and partners to develop and validate new AI capabilities.
English
0
153
1.3K
7.7M
Bioecomatics retuiteado
NEJM
NEJM@NEJM·
Presented at #ACC26: Among patients with atherosclerotic cardiovascular disease, targeting an LDL cholesterol level below 55 mg per deciliter led to a lower 3-year risk of cardiovascular events than targeting a level below 70 mg per deciliter. Full Ez-PAVE trial results: nejm.org/doi/full/10.10… Editorial: Paving the Road toward Targeted Lipid Lowering nejm.org/doi/full/10.10… @ACCinTouch
NEJM tweet media
English
12
265
733
168.6K