Miha Jenko

108 posts

Miha Jenko banner
Miha Jenko

Miha Jenko

@miha_jenko

SWE, ML & DS professional. #NLProc specialist. Opinions expressed are not my employer's. RTs don't count as endorsements.

Ljubljana, Slovenia Katılım Mayıs 2022
49 Takip Edilen10 Takipçiler
Miha Jenko retweetledi
Max Hager
Max Hager@yachty66·
✨Introducing GPU Benchmark Tool A simple tool to test the health and performance of your NVIDIA GPU and see its performance. Let's be honest, FurMark and 3DMark suck 😂 and they are not really testing AI performance.
English
5
4
38
8K
Miha Jenko retweetledi
Perry E. Metzger
Perry E. Metzger@perrymetzger·
Summary: Huawei trained a fairly big AI model entirely on their own hardware, with no Western GPUs. On the one hand, this shouldn’t be shocking to anyone. On the other hand, I know lots of people who claim that this couldn’t happen, so I thought it was worth reposting. (I expect a lot of people in the West will be even more shocked when the Chinese catch up on chip process technology.)
English
1
19
89
18.7K
Miha Jenko retweetledi
Matthew Green
Matthew Green@matthew_d_green·
You should use Signal. Seriously. There are other encrypted messaging apps out there, but I don’t have as much faith in their longevity. In particular I have major concerns about the sustainability of for-profit apps in our new “AI” world.
English
18
102
570
51K
Miha Jenko retweetledi
Hamel Husain
Hamel Husain@HamelHusain·
. @sh_reya 's paper confirms what I see in practice 1) Automated evals don't work (without semi-manual human alignment) 2) Most tools don't provide this alignment 3) Automated evals add mostly noise 4) You can only write good evals by looking at data and reacting to failures
Hamel Husain tweet mediaHamel Husain tweet mediaHamel Husain tweet mediaHamel Husain tweet media
English
19
79
708
76.1K
Miha Jenko
Miha Jenko@miha_jenko·
@LeonDerczynski Researches can fall into the trapping offered by step-wise learning robustness. Since the algorithms are robust to lone errors in the training data, we develop habits to follow up on the training error whilst failing to apply the same rigor on test data too.
English
0
0
0
38
Leon Derczynski ✍🏻 🌞🏠🌲
Why are NLP benchmarks consistently so poor quality? MT-Bench' resason, math, and coding eval instances were wrong in *25%* of cases Utter disaster. Easy fix: Look at your data with your eyes and be prepared to reject it
Inflection AI@inflectionAI

Evaluation is everything! While testing Inflection-2.5, we found that MT-Bench has a bunch of incorrect answers. Here we share the corrections for everyone to use, and we release a new Physics GRE benchmark for people to try out. inflection.ai/inflection-2-5

English
2
0
20
2.8K
Miha Jenko
Miha Jenko@miha_jenko·
@mtrc We are also going to need lossless video-audio on the internet, because these days it's hard to trust a low-res video with bleepy audio to not be AI-generated.
English
0
0
1
548
mike cook
mike cook@mtrc·
dude's got it bang on. the one thing we're definitely not going to need, in a world increasingly full of computers, is people who understand computers.
English
14
40
833
50.8K
Miha Jenko retweetledi
Rachel Thomas
Rachel Thomas@math_rachel·
Hype around AI in medicine often ignores two key risks: - patterns of automation contribute to centralization of power - medical knowledge is limited by the systemic refusal to trust patient expertise 1/ my new post: rachel.fast.ai/posts/2024-02-…
English
8
71
241
40.5K
Miha Jenko retweetledi
dr. jack morris
dr. jack morris@jxmnop·
today i found out that this one australian guy has been toiling away making incredibly detailed Neural Circuit Diagrams with the vibe of a 1950s issue of Popular Mechanics, but content fit for the 2020s behold. the Transformer
dr. jack morris tweet media
English
23
585
4.9K
435K
Miha Jenko
Miha Jenko@miha_jenko·
@lvwerra This only works for you because you have some context already.
English
0
0
0
52
Stella Biderman
Stella Biderman@BlancheMinerva·
Does anyone have recommended tools / templates for creating some of the standard diagrams in transformer papers like the ones below?
Stella Biderman tweet mediaStella Biderman tweet mediaStella Biderman tweet media
English
63
27
388
182.9K
Miha Jenko retweetledi
Jim Fan
Jim Fan@DrJimFan·
A fact worth highlighting: NVIDIA is making its own *CPU*, and will increasingly excel at it. To max out GPU's performance, building CPU in-house is an inevitable path. Below is GH200, the first superchip that includes all home-grown components: CPU (Grace), GPU (Hopper), and NVLink Chip-2-Chip (C2C) interconnect (delivers up to 900 GB/s total bandwidth). On GH200, CPU and GPU can access each other's memory freely and concurrently. Whitepaper: resources.nvidia.com/en-us-grace-cp…
Jim Fan tweet media
English
59
327
2K
606.4K
Miha Jenko retweetledi
Kevin Fischer
Kevin Fischer@kevinafischer·
I don't talk much about this - I obtained one of the first FDA approvals in ML + radiology and it informs much of how I think about AI systems and their impact on the world. If you're a pure technologist, you should read the following: There's so much to unpack for both why Geoff was wrong, and why his future predictions should not be taken seriously either. Geoff made a classic error that technologists often make, which is to observe a particular behavior (identifying some subset of radiology scans correctly) against some task (identifying hemorrhage on CT head scans correctly), and then to extrapolate based on that task alone. The reality is that reducing any job, especially a wildly complex job that requires a decade of training, to a handful of tasks is quite absurd. Here's a bunch of stuff you wouldn't know about radiologists unless you built an AI company WITH them instead of opining about their job disappearing from an ivory tower. (1) Radiologists are NOT performing 2d pattern recognition - they have a 3d world model of the brain and its physical dynamics in their head. The motion and behavior of their brain to various traumas informs their prediction of hemorrhage determination. (2) Radiologists have a whole host of grounded models to make determinations, and actually, one of the most important first order determination they make is whether there is anything notably wrong with a brain structure that "feels" off. As a result, classifiers aren’t actually performing the same task even as radiologists. (3) Radiologists, because they have a grounded brain model, only need to see a single example of a rare and obscure condition to both remember it and identify it in the future. This long tail of rare conditions to avoid missing is a large part of their training, and no one has any clue how to make a model that acts similar in this way. (4) There’s so many ways to make Radiologist lives easier instead of just replacing them, it doesn’t even make sense to try. I interviewed and hired 25 radiologists, whose primary and chief complaint was that they had to reboot their computers several times a day. (5) A large part of the radiologist job is communicating their findings with physicians, so if you are thinking about automating them away you also need to understand the complex interactions between them and different clinics, which often are unique. (6) Every hospital is a snowflake, data is held under lock and key, so your algorithm might not work in a bunch of hospitals. Worse, the imagenet datasets have such wildly different feature sets they don’t do much for pretraining for you. (7) Have you ever tried to make anything in healthcare? The entire system is optimized to avoid introducing any harm to patients - explaining the ramifications of that would take an entire book, but suffice to say even if you had an algorithm that could automate away radiologists I don’t even know if you could create a viable adoption strategy in the US regulatory environment. (8) The reality is that for every application, the amount of specific and UNKNOWABLE domain knowledge is immense. LONG STORY SHORT: thinkers have a pattern where they are so divorced from implementation details that applications seem trivial, when in reality, the small details are exactly where value accrues. Should you be worried about GPT5 being used to automate vulnerability detection on websites before they’re patched? Maybe. Should you be worried GPT5 is going to interact with SOCIAL systems and destroy our society single-handedly? No absolutely not.
Yann LeCun@ylecun

This must be said and repeated. Yes, Geoff was totally wrong to predict a drop in radiologist positions. We knew that it was wrong when he said it. We have data now.

English
85
566
2.7K
1.6M
Miha Jenko
Miha Jenko@miha_jenko·
@davisblalock d) The conservative position has not been equally reflected in internet data as the liberal position. The scrapped data, not the content filter, skews toward the status quo of the available data.
English
0
0
0
35
Davis Blalock
Davis Blalock@davisblalock·
c) Even with a balanced pool of labelers, there's a liberal vs conservative divide in labeling practices. This divide could come either from cultural differences (e.g., I've rarely heard conservatives use the word "harmful") or bias in the labelers' instructions. [10/11]
English
3
3
53
7.8K
Davis Blalock
Davis Blalock@davisblalock·
Imagine a world where keyboards only let you type sentences that the keyboard manufacturer agrees with. Or where spellcheck and autocorrect work if you're arguing for one side of a debate, but not the other. That's the world we're building with AI services like ChatGPT. [1/11]
Davis Blalock tweet mediaDavis Blalock tweet mediaDavis Blalock tweet mediaDavis Blalock tweet media
English
52
174
814
255.8K
f4mi ‼️
f4mi ‼️@f4micom·
@Ianternn that’s quite reassuring but considering loads of people use unsafe password that are present in dictionaries it still can provide for a quite useful resource for potential malicious actors
English
3
1
165
33K
f4mi ‼️
f4mi ‼️@f4micom·
github.com/ggerganov/kbd-… this tool lets you extract text from an audio recording of keyboard strokes, right now, for free i am not making this shit up, you can potentially steal a password from an audio recording in an office
f4mi ‼️ tweet media
English
112
715
4.7K
1.1M
Miha Jenko
Miha Jenko@miha_jenko·
@matthew_d_green This legislature might not make it in, but it was enough for R&D departments to pay attention and invest.
English
0
0
0
81
Miha Jenko
Miha Jenko@miha_jenko·
@matthew_d_green That's what AI hype does. Pumps up the media and drives legislators and lobbyists into a frenzy. The industry and the academia supports it, of course, because mandates translate to contracts and grants.
English
3
1
5
932
Matthew Green
Matthew Green@matthew_d_green·
The EU’s “chat control” legislation is the most alarming proposal I’ve ever read. Taken in context, it is essentially a design for the most powerful text and image-based mass surveillance system the free world has ever seen.
English
54
1.1K
2.9K
879.7K
Miha Jenko
Miha Jenko@miha_jenko·
Someone mistook the @huggingface logo on my hoodie for Shell's. Too good not to share.
English
0
0
1
44