Data For Science

7.4K posts

Data For Science banner
Data For Science

Data For Science

@data4sci

Take Control Of Your Data. Join our Data Science Briefing newsletter for the best in #DataScience and #MachineLearning

Manhattan, NY Katılım Şubat 2015
461 Takip Edilen1.5K Takipçiler
Sabitlenmiş Tweet
Data For Science
Data For Science@data4sci·
Our latest Substack post is now live Graph Neural Networks 101 Come learn the the basics of how GNNs work under the hood, with a fascinating new dataset open.substack.com/pub/data4sci/p…
English
0
2
1
158
Data For Science
Data For Science@data4sci·
[2605.22763v1] Advancing Mathematics Research with AI-Driven Formal Proof Search — A clear overview of how AI-guided formal proof search can support real math research, not just toy benchmarks. Useful if you’re tracking where automated… arxiv.org/abs/2605.22763…
English
0
0
0
41
Data For Science
Data For Science@data4sci·
The Science of Unpredictability | Los Alamos National Laboratory — A clear, accessible look at how the FPUT problem helped spark chaos theory and modern nonlinear dynamics. Useful context for anyone working… lanl.gov/media/publicat…
English
0
1
1
75
Data For Science
Data For Science@data4sci·
After Automation | Every — A useful counterpoint to the “AI will eliminate work” narrative: as automation improves, it often shifts effort into new human tasks rather than removing it. Worth reading if you’re thinking about how roles and… every.to/p/after-automa…
English
0
1
1
50
Data For Science
Data For Science@data4sci·
[2605.06394] Lecture Notes on Statistical Physics and Neural Networks — Clear lecture-style notes connecting statistical physics tools to neural network theory—useful as a structured refresher if you want the big ideas and standard derivations… arxiv.org/abs/2605.06394
English
0
25
172
7.1K
Data For Science
Data For Science@data4sci·
[2605.12460] Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs — Interesting take on making LLMs less sequential by running parallel streams for reasoning, inputs, and outputs. Worth a skim if… arxiv.org/abs/2605.12460
English
0
1
1
61
Data For Science
Data For Science@data4sci·
Global approaches to infectious disease surveillance and modeling | Nature Medicine — Clear overview of why infectious disease surveillance is getting harder—and how federated analytics could unlock cross-border modeling… nature.com/articles/s4159…
English
0
0
0
31
Data For Science
Data For Science@data4sci·
Persuading large language models to comply with objectionable requests — Worth a read; the useful bits are usually in the examples and edge cases. pnas.org/doi/10.1073/pn…
English
0
0
0
38
Data For Science retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
English
7.9K
11.1K
149K
27.1M
Data For Science
Data For Science@data4sci·
[2604.09839] Steered LLM Activations are Non-Surjective — Useful if you’re thinking about activation steering as a general-purpose control knob: this paper argues the mapping from steering directions to reachable behaviors isn’t onto, so some… arxiv.org/abs/2604.09839
English
1
0
2
74
Data For Science
Data For Science@data4sci·
Project Glasswing: what Mythos showed us — A clear-eyed look at what happened when security-focused LLMs were aimed at real production code—what they caught, what they missed, and what needs to change before this approach can… blog.cloudflare.com/cyber-frontier…
English
0
0
0
41
Data For Science
Data For Science@data4sci·
Project MUSE -- Verification required! — Heads-up: this Project MUSE link is behind a verification check, so you may need to complete a quick challenge to access it. If you’re doing text/data mining, it points you to Customer Service for help. muse.jhu.edu/article/936213
English
0
0
0
33
Data For Science
Data For Science@data4sci·
[2605.05242] Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction — Useful reframing of retrieval for agentic search: instead of only ranking by semantic similarity, it argues for more direct,… arxiv.org/abs/2605.05242
English
0
0
0
41
Data For Science
Data For Science@data4sci·
[2605.01048] Compared to What? Baselines and Metrics for Counterfactual Prompting — Useful framing for evaluating counterfactual prompting: it pushes you to define the right baselines and metrics before drawing conclusions. Worth a skim if… arxiv.org/abs/2605.01048
English
0
0
0
40
Data For Science
Data For Science@data4sci·
Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations — Interesting approach to interpretability: train a verbalizer and reconstructor so a short text description has to preserve enough… transformer-circuits.pub/2026/nla/index…
English
0
0
0
45
Data For Science
Data For Science@data4sci·
[2605.06647] Superintelligent Retrieval Agent: The Next Frontier of Information Retrieval — A concise arXiv overview of “superintelligent” retrieval agents and what they could change about search beyond ranking documents. Useful if you’re… arxiv.org/abs/2605.06647
English
0
0
0
67