Anuda Weerasinghe

339 posts

Anuda Weerasinghe

Anuda Weerasinghe

@anuda_w

audio @googledeepmind

Katılım Ocak 2015
969 Takip Edilen145 Takipçiler
Anuda Weerasinghe
Anuda Weerasinghe@anuda_w·
@BikingEddy @GoogleAI it works with any headphones on any Android device. Currently only in the US, Mexico and India - but we're working on expanding very soon.
English
3
0
2
65
Eddy
Eddy@BikingEddy·
@GoogleAI Does it need to be used with google ear buds or can be used with hearing aids connected with the phone by Bluetooth?
English
1
0
0
87
Google AI
Google AI@GoogleAI·
Listen up 🔊 We’ve made some updates to our Gemini Audio models and capabilities: — Gemini’s live speech-to-speech translation capability is rolling out in a beta experience to the Google Translate app, bringing you real-time audio translation that captures the nuance of human speech — Gemini 2.5 Flash and 2.5 Pro Text-to-Speech preview models bring improved adherence to style prompts, precision pacing with context-aware speed adjustments, and character voice consistency for multi-speaker scenarios — Gemini 2.5 Flash Native Audio is now updated, with improvements to handle complex workflows, navigate user instructions, and hold natural conversations
English
222
758
4.1K
1.1M
Anuda Weerasinghe
Anuda Weerasinghe@anuda_w·
@chrisnk14 @GoogleAI yep, if you set the language on the left to "Detect language" it'll translate any language to the language you've selected on the right.
English
1
0
2
55
chris S. Nakamoto
chris S. Nakamoto@chrisnk14·
@GoogleAI This is great. Does it detect the language automatically and translate to your preferred language?
English
1
0
3
1.1K
Anuda Weerasinghe
Anuda Weerasinghe@anuda_w·
@ainativefirm @OfficialLoganK it's currently available on the google translate app on android with any headphones in the us, mexico and india. we're working on bringing it to ios and other regions soon.
English
1
0
3
92
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
Realtime speech to speech translation powered by Gemini, available in Google Translate now, coming to developers early next year : )
English
152
352
3.1K
384.2K
Anuda Weerasinghe
Anuda Weerasinghe@anuda_w·
if you don't have headphones connected, you'll hear translations on your phone speaker for both sides of the conversation. if you have headphones connected, the other person can either read text translations on screen or you can set audio output to your speakers for their language. we're working on making these output settings less confusing though :)
Anuda Weerasinghe tweet media
English
1
0
3
117
Jaden Tripp
Jaden Tripp@jadenitripp·
@OfficialLoganK Very cool. Seems like both speakers need the translate app. It would be cool if I could just use the speaker on my phone for both speakers. Like it generates what I said to the other person's language and vice versa.
English
1
0
2
1.2K
Anuda Weerasinghe retweetledi
Robby Stein
Robby Stein@rmstein·
1/ Two new features are coming to Lens starting today that let you search with your camera in new ways. These features use our latest AI models in Google Search which are especially good at multimodal tasks. 🧵
English
4
4
30
2.9K
Anuda Weerasinghe retweetledi
Ivan Leo
Ivan Leo@ivanleomk·
Using whisper is so 2023. Just use gemini, pass in the raw audio directly and prompt the model directly with all the questions you have. With instructor, we can get - The exact mispronounced word - The timestamp when we did it - Advice on how to do better Flash truly is the model that keeps on giving @OfficialLoganK
English
8
15
156
90.3K
Anuda Weerasinghe retweetledi
Prashant
Prashant@Prashant_1722·
Gemini 1.5 002 beats OpenAI o1-preview on MATH, and it does it at 1/10th the cost and no thinking time. When 2024 started, lot of people were critical of Google falling behind OpenAI. However, since then they have gathered themselves to pull the right strings. Whether it is hardware (chips), software (Pixel) or AI models (Gemini, Gemma, AlphaFold, etc.) Really impressed by the team at Google DeepMind has outdone themselves month over month to bring superior releases one after the other. Scaling these models with some of the cheapest price points has put them ahead quickly. Excited to see what more is coming. r/singularity u/callmepyro o1-preview Math benchmark in thread.
Prashant tweet media
👩‍💻 Paige Bailey@DynamicWebPaige

"Gemini 1.5 002 beats o1-preview on MATH, and it does it at 1/10th the cost and no thinking time." 🧮🚀 reddit.com/r/singularity/…

English
4
13
183
38.4K
Anuda Weerasinghe
Anuda Weerasinghe@anuda_w·
Salesperson probably gets a commission that's a % of total sale (piecewise comp) - so they're incentivized to sell most expensive version with all the options. If they give you all the options with a breakdown of the specs, you're more likely to select a lower spec with only the options you need. They can probably do better and upsell if they used menu effects to their advantage though. Anyways this kind of info asymmetry is not as much of a problem anymore, you're one search away from all the info you need, even if the search needs to start from a picture of a car you see at the dealership 😉
English
1
0
0
60
Raaid Tanveer
Raaid Tanveer@raaidrt·
Some observations on car sales: - There's a rule in car dealerships that in order to "get the numbers" i.e. a price quote from car salesmen, the client needs to test drive the car first. - The salesmen, for some reason unknown to me, are keen on demo-ing just one car and...
English
2
0
1
136
Gram Liu
Gram Liu@gramliu·
As they say, Fall is for new chapters
English
3
0
12
457
James Campbell
James Campbell@jam3scampbell·
I fucking love CMU. Looking through the course catalog, there's like >25 courses that cover LLMs and topics on the frontier of AI. This is what happens when you give Machine Learning, Language Technology, Robotics, etc their own entire departments, as god intended 🫡
English
28
54
1.3K
203.8K
Anuda Weerasinghe retweetledi
Mishaal Rahman
Mishaal Rahman@MishaalRahman·
You can now use your voice to add context to searches in Google Lens! Press and hold on the shutter button in Lens, and it'll say "speak now to ask about this image." After speaking your question, let go of the button and Google Gemini will attempt to provide an answer.
English
12
20
254
19.2K
Anuda Weerasinghe
Anuda Weerasinghe@anuda_w·
The tech Twitter and cricket Twitter parts of my feed have converged to the same topic today.
English
0
0
3
103
Anuda Weerasinghe
Anuda Weerasinghe@anuda_w·
@SajithCooray @harshap Tried scanning a UPI qr with a LankaQR enabled app too, but didn't work - seems it hasn't been implemented as part of the initial partnership. Not sure why not.
English
1
0
0
27
Sajith Cooray
Sajith Cooray@SajithCooray·
@harshap @anuda_w Any reason why UPI works in SL while LankaQR does not work in India? (Subject to correction)
Sri Lanka 🇱🇰 English
1
0
0
39
Anuda Weerasinghe
Anuda Weerasinghe@anuda_w·
Yes, you can create a prepaid account (PPI) and use UPI with it, but doing so is a pain as far as I can tell - it's only for g20 countries, and you need to visit a PPI issuer who can "perform money exchange operations" in person to set up the account. Seems revolut is trying to solve the problem, but it's wait-list only for now.
English
0
0
0
59
Harsha Purasinghe
Harsha Purasinghe@harshap·
@anuda_w Response from one of my good friends who was closely involved with UPI "Many things are not true. UPI always works via QR. He needs to sign up with the correct prepaid UPI provider or can even try Revolut in India"
English
2
0
0
137
Anuda Weerasinghe
Anuda Weerasinghe@anuda_w·
Figuring out how to price something like context caching seems like such an interesting problem that requires a pretty deep understanding of the underlying infra and model architecture. Even more interesting to try and infer/reverse engineer the high level infra and model architecture decisions given the pricing...
Logan Kilpatrick@OfficialLoganK

Gemini 1.5 Flash continues to be the best value proposition for anyone building with LLMs. - $0.0875 / 1 million tokens (cache prompts < 128K) - $0.175 / 1 million tokens (cache prompts > 128K) - $1.00 / 1 million tokens per hour (cache storage) Big 🚢 by @shresbm and team!!

English
0
0
3
319