Fabián Baptista

746 posts

Fabián Baptista banner
Fabián Baptista

Fabián Baptista

@fbaptista

CTO at @apptim_official. Co founder at @abstractaUS

Montevideo, Uruguay Katılım Şubat 2009
2.8K Takip Edilen4.1K Takipçiler
Fabián Baptista
Fabián Baptista@fbaptista·
@PNCBank_Help Hello, I have my business account blocked, phone support is with high demand so I can't talk with anybody at the bank. We need a business representative phone number to call asap. Can you please provide me a business number to talk with somebody instead of automatic machine?
English
2
0
0
39
PNC Bank Help
PNC Bank Help@PNCBank_Help·
Happy Monday! How can we help today?
English
4
0
4
570
Fabián Baptista
Fabián Baptista@fbaptista·
@Minucha01 @frascafrasca Yo estoy en la misma, y por ahora la respuesta q tengo es: experimenta sin parar nuevas herramientas y nuevas IAs, aprende mas rapido y mas autonomo, puede construir arriba de tecnologias y plataformas 10 veces mas rapido.
Español
0
0
1
50
Minu
Minu@Minucha01·
@frascafrasca Y ahora que hace mi hijo programador junior que busca trabajo ? Terminó hace poco y realmente le gusta 🥺
Español
4
0
0
1.5K
Gonzalo Frasca
Gonzalo Frasca@frascafrasca·
Desde hace un año que tengo varios proyectos de software que intento realizar usando IA. Invariablemente, la IA encontraba un problema que no podía solucionar. Entonces, esperaba unos meses a que saliera un mejor modelo y avanzaba un poco más. Esta semana, luego de un año de
Español
9
15
88
26.3K
Fabián Baptista retweetledi
Carlos Santana
Carlos Santana@DotCSV·
En realidad es muy loco pensar que mañana la mayoría de los desarrolladores del mundo podrán trabajar de forma más eficiente que el día de hoy, por el simple hecho de que una IA se ha actualizado. Diría que nunca habíamos vivido un progreso tecnológico con un ritmo de mejora tan rápido y que tuviera una aplicabilidad tan inmediata. De una semana a otra, actualizas el modelo y pum, todo funciona mejor. Y así en muchísimos sectores.
Español
44
89
911
104.9K
Fabián Baptista
Fabián Baptista@fbaptista·
@frascafrasca Así es! Nuestra industria todavía está dormida. Muchos colegas me dicen “ahora cuando pase el humo/hype de la IA”. Es para analizar. Probaste v0? Recomiendo para arrancar a prototipos e iterar ideas en segundos. Si es algo “choto” puede ya quedar productivo
Español
2
0
7
1.8K
Gonzalo Frasca
Gonzalo Frasca@frascafrasca·
Increíble. Hace tiempo quería prototipar una app. Calculaba que llevaría unos 3 meses de trabajo y costaría entre 20 y 30.000 dólares. La acabo de hacer con IA. Llevó 12 horas y 17 dólares. No 17000. 17 dólares. Agarrense muy fuerte porque es inimaginable el impacto de esto.
Español
24
42
465
56.8K
Fabián Baptista retweetledi
Roger Abelenda
Roger Abelenda@rabelenda·
Had a blast at #CommunityOverCode in Denver! Reconnected with old friends, met new ones, and had great convos. Thanks to the organizers and everyone for an amazing event! Bonus: Loved the engaging talks and feedback on my Skywalking Copilot presentation!
Roger Abelenda tweet mediaRoger Abelenda tweet mediaRoger Abelenda tweet mediaRoger Abelenda tweet media
English
1
2
4
433
Fabián Baptista retweetledi
Abstracta Latam
Abstracta Latam@AbstractaLatam·
Estamos a menos de una semana de este #webinar de @ChiletecOrg que contará con la participación de @fbaptista. Si te gusta la innovación y te interesan las nuevas tecnologías, este evento es para ti. En este hilo 👇 te dejamos la info completa y el link de registro
Abstracta Latam@AbstractaLatam

🚀 ¡Únete a este webinar de @ChiletecOrg y descubre casos de éxito implantando LLMs! 💡 En este evento, y de la mano de Fabián Baptista, podrás conocer distintas experiencias sobre en qué casos de uso se ha aplicado con éxito la nueva tecnología en distintas industrias.

Español
0
2
4
205
Fabián Baptista retweetledi
Jim Fan
Jim Fan@DrJimFan·
I stand corrected: GPT-4o does NOT natively process video stream. The blog says it only takes image, text, and audio. That's sad, but the principle I said still holds: the right way to make a video-native model efficient is to co-develop the streaming codec on edge device. Gemini seems to treat video as first-class citizen. Let's see how Google I/O goes tomorrow.
Jim Fan@DrJimFan

I know your timeline is flooded now with word salads of "insane, HER, 10 features you missed, we're so back". Sit down. Chill. Take a deep breath like Mark does in the demo . Let's think step by step: - Technique-wise, OpenAI has figured out a way to map audio to audio directly as first-class modality, and stream videos to a transformer in real-time. These require some new research on tokenization and architecture, but overall it's a data and system optimization problem (as most things are). High-quality data can come from at least 2 sources: 1) Naturally occurring dialogues on YouTube, podcasts, TV series, movies, etc. Whisper can be trained to identify speaker turns in a dialogue or separate overlapping speeches for automated annotation. 2) Synthetic data. Run the slow 3-stage pipeline using the most powerful models: speech1->text1 (ASR), text1->text2 (LLM), text2->speech2 (TTS). The middle LLM can decide when to stop and also simulate how to resume from interruption. It could output additional "thought traces" that are not verbalized to help generate better reply. Then GPT-4o distills directly from speech1->speech2, with optional auxiliary loss functions based on the 3-stage data. After distillation, these behaviors are now baked into the model without emitting intermediate texts. On the system side: the latency would not meet real-time threshold if every video frame is decompressed into an RGB image. OpenAI has likely developed their own neural-first, streaming video codec to transmit the motion deltas as tokens. The communication protocol and NN inference must be co-optimized. For example, there could be a small and energy-efficient NN running on the edge device that decides to transmit more tokens if the video is interesting, and fewer otherwise. - I didn't expect GPT-4o to be closer to GPT-5, the rumored "Arrakis" model that takes multimodal in and out. In fact, it's likely an early checkpoint of GPT-5 that hasn't finished training yet. The branding betrays a certain insecurity. Ahead of Google I/O, OpenAI would rather beat our mental projection of GPT-4.5 than disappoint by missing the sky-high expectation for GPT-5. A smart move to buy more time. - Notably, the assistant is much more lively and even a bit flirty. GPT-4o is trying (perhaps a bit too hard) to sound like HER. OpenAI is eating Character AI's lunch, with almost 100% overlap in form factor and huge distribution channels. It's a pivot towards more emotional AI with strong personality, which OpenAI seemed to actively suppress in the past. - Whoever wins Apple first wins big time. I see 3 levels of integration with iOS: 1) Ditch Siri. OpenAI distills a smaller-tier, purely on-device GPT-4o for iOS, with optional paid upgrade to use the cloud. 2) Native features to stream the camera or screen into the model. Chip-level support for neural audio/video codec. 3) Integrate with iOS system-level action API and smart home APIs. No one uses Siri Shortcuts, but it's time to resurrect. This could become the AI agent product with a billion users from the get-go. The FSD for smartphones with a Tesla-scale data flywheel.

English
69
89
1K
428.5K
Nick St. Pierre
Nick St. Pierre@nickfloats·
Friendly reminder… Reality is broken.
English
136
1.3K
9.8K
3.4M
Roger Abelenda
Roger Abelenda@rabelenda·
@AndrewYNg We developed and have been using github.com/abstracta/brow… to build copilots for these iterative workflows with users. We see a huge potential in implementing copilots that allow users to interact in more efficient ways with existing web apps.
English
1
2
2
98
Andrew Ng
Andrew Ng@AndrewYNg·
I think AI agentic workflows will drive massive AI progress this year — perhaps even more than the next generation of foundation models. This is an important trend, and I urge everyone who works in AI to pay attention to it. Today, we mostly use LLMs in zero-shot mode, prompting a model to generate final output token by token without revising its work. This is akin to asking someone to compose an essay from start to finish, typing straight through with no backspacing allowed, and expecting a high-quality result. Despite the difficulty, LLMs do amazingly well at this task! With an agentic workflow, however, we can ask the LLM to iterate over a document many times. For example, it might take a sequence of steps such as: - Plan an outline. - Decide what, if any, web searches are needed to gather more information. - Write a first draft. - Read over the first draft to spot unjustified arguments or extraneous information. - Revise the draft taking into account any weaknesses spotted. - And so on. This iterative process is critical for most human writers to write good text. With AI, such an iterative workflow yields much better results than writing in a single pass. Devin’s splashy demo recently received a lot of social media buzz. My team has been closely following the evolution of AI that writes code. We analyzed results from a number of research teams, focusing on an algorithm’s ability to do well on the widely used HumanEval coding benchmark. You can see our findings in the diagram below. GPT-3.5 (zero shot) was 48.1% correct. GPT-4 (zero shot) does better at 67.0%. However, the improvement from GPT-3.5 to GPT-4 is dwarfed by incorporating an iterative agent workflow. Indeed, wrapped in an agent loop, GPT-3.5 achieves up to 95.1%. Open source agent tools and the academic literature on agents are proliferating, making this an exciting time but also a confusing one. To help put this work into perspective, I’d like to share a framework for categorizing design patterns for building agents. My team AI Fund is successfully using these patterns in many applications, and I hope you find them useful. - Reflection: The LLM examines its own work to come up with ways to improve it. - Tool use: The LLM is given tools such as web search, code execution, or any other function to help it gather information, take action, or process data. - Planning: The LLM comes up with, and executes, a multistep plan to achieve a goal (for example, writing an outline for an essay, then doing online research, then writing a draft, and so on). - Multi-agent collaboration: More than one AI agent work together, splitting up tasks and discussing and debating ideas, to come up with better solutions than a single agent would. I’ll elaborate on these design patterns and offer suggested readings for each next week. [Original text: deeplearning.ai/the-batch/issu…]
Andrew Ng tweet media
English
216
1.2K
5.2K
837.3K
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
What is an AI product which doesn’t exist today but you wish did? The more detailed the better 🧵👇🏻 I’ll start: AGI.
English
285
31
440
218.5K
Fabián Baptista retweetledi
AI Notkilleveryoneism Memes ⏸️
@OpenAI @tszzl This is interesting, hadn't heard this before. @ilyasut: "The Open in openAI means that everyone should benefit from the fruits of AI after its built, but it's totally OK to not share the science)"
AI Notkilleveryoneism Memes ⏸️ tweet media
English
41
39
531
126.3K
Fabián Baptista retweetledi
Gaston Milano Millan
Gaston Milano Millan@GMilano·
We have added support for the new Anthropic Claude 3 models. Evaluating assistants with various suppliers and various models is necessary in many projects for reasons of effectiveness and cost efficiency. #GeneXus genexus.com/en/products/ge… @Globant
English
1
8
21
935
Fabián Baptista
Fabián Baptista@fbaptista·
@ealmeida Feliz cumple atrasado. Que la vida te regale una nueva gloria que añorar en los próximos años
Español
0
0
2
22
Enrique Almeida
Enrique Almeida@ealmeida·
Gané la travesía "Trompas de Falopio" hace 57 años y 9 meses. Eran millones de participantes y llegué primero. Desde ese día vivo añorando glorias pasadas.
Español
5
1
15
477
Fabián Baptista retweetledi
David Giordano
David Giordano@3dgiordano·
Simplificando, #GX30 @GeneXus desde 1989 en DOS, Generando para DOS/AS400, luego Windows, múltiples capas cliente/servidor, para web, para mobile, para la nube, para chatbots y para generative AI en 2023. El IDE de desarrollo evolucionó de DOS, Windows, y a la nube en web.
Español
1
5
9
899