Markus Frohmann

68 posts

Markus Frohmann

Markus Frohmann

@FrohmannM

AI Master Student @jkulinz https://t.co/cVbiZvlrxj

Katılım Şubat 2020
519 Takip Edilen131 Takipçiler
Markus Frohmann
Markus Frohmann@FrohmannM·
Excited to share that I'll be joining @thomsonreuters Labs in Zug, Switzerland, as an Applied AI Scientist Intern from April to September 2026! 🏔️ A nice bridge between finishing my MSc at JKU Linz and starting my PhD later this year - let me know if you're around!
English
0
0
2
100
Markus Frohmann
Markus Frohmann@FrohmannM·
@usmasfr Cool use case! You can start with our default setup, using 30 epochs, here: github.com/segment-any-te… Otherwise, I'd focus more on clean segmented training data than tuning epochs; >100 segmented training sentences would be good. You can also tune LoRA rank a bit if needed.
English
1
0
1
23
usama
usama@usmasfr·
@FrohmannM Fine-tuning sat via LoRA to detect scene boundaries in fiction manuscripts. Any tips on optimal training data size and epoch tuning for this kind of domain adaptation? Appreciate the work 🙌
English
1
0
0
42
Markus Frohmann
Markus Frohmann@FrohmannM·
wtpsplit now supports length-constrained segmentation ✂️ min/max chunk length (chars) while preserving semantic chunks - should be great for RAG! Example (≤30 chars): [Landing 5pm → Beimen.] [Let's meet at: Ximen Exit 6.] [Then: Ningxia Night Market...] [Late-night snack!']
Markus Frohmann@FrohmannM

Introducing 🪓Segment any Text! 🪓 A new state-of-the-art sentence segmentation tool! Compared to existing tools (and strong LLMs!), our models are far more: 1. efficient ⚡ 2. performant 🔝 3. robust 🚀 4. adaptable 🎯 5. multilingual 🗺

English
2
0
0
146
Markus Frohmann
Markus Frohmann@FrohmannM·
More info + how to use: #new-v22-length-constrained-segmentation" target="_blank" rel="nofollow noopener">github.com/segment-any-te…
English
0
0
0
30
Markus Frohmann
Markus Frohmann@FrohmannM·
I'm at #EMNLP2025 in Suzhou this year! Looking forward to connecting with the community after a year's break and spending some time abroad。。。再見!
English
1
0
16
2.3K
Markus Frohmann
Markus Frohmann@FrohmannM·
@Deezer @aclmeeting I view this work as an important extension of current single-modality detectors while maintaining flexibility and modularity. It's not production-ready, but it highlights key paradigms for detection: Using all available information from just the audio and a focus on robustness.
English
1
0
0
90
Markus Frohmann
Markus Frohmann@FrohmannM·
Excited to share two new papers on AI-generated music detection from my research internship at @Deezer, published in @ismir_conf #ISMIR2025 and @aclmeeting #ACL2025 Findings! 🎶🤖 The problem: most AI music detectors are impractical or unreliable in real-world settings.
English
5
0
3
457
Markus Frohmann retweetledi
Benjamin Minixhofer
Benjamin Minixhofer@bminixhofer·
We created Approximate Likelihood Matching, a principled (and very effective) method for *cross-tokenizer distillation*! With ALM, you can create ensembles of models from different families, convert existing subword-level models to byte-level and a bunch more🧵
Benjamin Minixhofer tweet media
English
2
26
88
6.5K
Markus Frohmann
Markus Frohmann@FrohmannM·
Curious about our SoTA text segmentation tool? 🪓 It's gonna help you across all kinds of NLP tasks! Learn more at our poster session: Tuesday, 4pm, Jasmine room at #EMNLP2024! 🗓️ See you there! I'll be attending the whole conference - happy to connect with everyone! 👋
Markus Frohmann@FrohmannM

Introducing 🪓Segment any Text! 🪓 A new state-of-the-art sentence segmentation tool! Compared to existing tools (and strong LLMs!), our models are far more: 1. efficient ⚡ 2. performant 🔝 3. robust 🚀 4. adaptable 🎯 5. multilingual 🗺

English
0
0
5
238
Markus Frohmann
Markus Frohmann@FrohmannM·
Excited to share that I joined @researchdeezer as a research intern to work with @evpure and @Gabolsgabs on detecting AI-generated lyrics !🎶 The first few weeks have been amazing, and I am excited about what is to come—life in Paris certainly has unparalleled charm!
English
0
0
6
204
Markus Frohmann
Markus Frohmann@FrohmannM·
This was an awesome summer! I can only recommend ETH's summer research fellowship program 🏔️ Also happy about the project's progress - integrating videos into existing architectures is quite exciting, stay tuned! Super grateful to Ryan Cotterell and @glnmario for supervising me.
Markus Frohmann@FrohmannM

Excited to share that I joined @ETH Zürich as a summer research fellow, supervised by Prof. @ryandcotterell, working on ✨Multimodal LLMs! ✨ The first few weeks have been a blast, and I'm looking forward to the weeks ahead! 📽️

English
1
0
6
368
Markus Frohmann retweetledi
Cohere Labs
Cohere Labs@Cohere_Labs·
Congratulations to C4AI Research Grant recipient @FrohmannM and all authors of "Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation" for their EMNLP acceptance!🥳
Markus Frohmann@FrohmannM

This got accepted to #EMNLP 24 Main! ✨ Same goes for our paper on unsupervised debiasing 🤓 ✈️🇺🇲

English
1
2
7
1.4K