Maurício Maia

206 posts

Maurício Maia

Maurício Maia

@mmaia

Software & Data Engineer building: 🌎 https://t.co/mXsKJ361fq 👩‍💻 https://t.co/ZpkD4tYqkG

Katılım Temmuz 2007
435 Takip Edilen115 Takipçiler
andrew gao
andrew gao@itsandrewgao·
in this essay i will discuss how tiktok killed mobile gaming👾🎮
andrew gao tweet media
English
197
236
5.1K
1.4M
Maurício Maia
Maurício Maia@mmaia·
@bentossell firecrawl.dev can scrape the docs and turn them into markdown. It has a LangChain integration that should make the second step easy to implement.
English
0
0
1
110
Ben Tossell
Ben Tossell@bentossell·
how do you download app documentation easily? (if you can?) like stripe, notion, posthog, etc I wanna download, then use ai (duh) on them. use case is often giving two lots of docs and saying how do i make these work together
English
22
0
24
18.1K
Greg Kamradt
Greg Kamradt@GregKamradt·
@mmaia oh cool, just for openai? Or others too?
English
1
0
0
285
Greg Kamradt
Greg Kamradt@GregKamradt·
I wanted to build a mini app that alerted me when there were new jobs at OpenAI New openings help you read between the lines of a company's growth The build ended up being bigger than I thought Here's the story: Started wanting to know when there was new jobs, but if you want to know what is new, you need to know what is old. So you need to store the old state (I didn't want to use created_at timestamps). So enter a postgres db (via supabase) But why do it just for OpenAI? I want to see other AI companies too Ok so now we need the concept of a company. I originally attached a job_board_token to each company, but had a hard time matching a stream of job board data with a company. So I separated streams from companies, then made an associations table But wait! Turns out that not all AI companies use the same job board provider. OpenAI uses Ashby, Anthropic used Lever, switched to Greenhouse Ok so now I need to create different fetchers and parsers for each job board But then I obviously wanted to config where notifications went and for specific companies So then I needed an events table, subscribers table that said who was listening for events (with filters) and notifications table But wait! Let's let the system grow, how do you find new job boards to track and scrape? I needed a way to make a proposal (a potential board to match), and confirm that it had valid data. So made an proposal service that just scanned the web for potential matches, then checked them against my job board providers But there isn't just one notification type, you'll have tweets (like the quote here), webhooks, emails, sms, etc. so you need to hook up all those channels Some light LLM work to summarize the job description (only for notifs, not for all jobs) Github actions scrapes 3K job boards every day (mostly tech/AI companies) And so here we are today Up next is to pull out the tools mentioned in each job to make a proper list of tools per companies
Greg Kamradt tweet mediaGreg Kamradt tweet mediaGreg Kamradt tweet media
AI Jobs@AIJobAlert

New Job @ OpenAI Software Engineer, Supercomputing Scheduling • Design, implement, operate job scheduling systems • Interface with researchers on workload requirements • Harmonize job lifecycle with infrastructure • Experience with hyperscale scheduling systems

English
22
5
128
43.6K
vicki
vicki@vboykis·
Free product idea: a search engine that’s actually recommendations, ie I search for a post URL and it returns me “posts like this across the internet”. For example, I want to read more about how software teams function or dysfunction erikbern.com/2023/12/13/sim…
English
2
1
38
6.8K
vicki
vicki@vboykis·
the rest of the owl
vicki tweet media
English
11
12
232
58.1K
Maurício Maia
Maurício Maia@mmaia·
@simonw If it is a batch process, add another step to "reduce" the tags.
English
0
0
0
150
Simon Willison
Simon Willison@simonw·
Anyone got any prompting tricks to help with consistent tagging? Ask an LLM for a JSON array of tags for some content and it will work... but if you repeat that against 100 pieces of content you'll get varying results because each document is considered independent of the others
English
52
16
269
90.1K
Maurício Maia
Maurício Maia@mmaia·
@mhmazur I sense that if a model performs at X for some set of problems, people will try to make it perform at the same level for other sets of problems by just improving the prompt.
English
0
0
0
19
Matt Mazur
Matt Mazur@mhmazur·
@mmaia For example, is there a prompt in the universe of possible prompts that could help PaLM 540B achieve a score of 80 instead of 57? How do we know?
English
1
0
1
54
Matt Mazur
Matt Mazur@mhmazur·
LLM theory question: when evaluating an LLM using benchmarks, how much gain is possible by better prompting? Is there a theoretical limit or might it be possible to see massive improvements simply by using an exceptional prompt?
English
13
1
8
7K
Maurício Maia
Maurício Maia@mmaia·
@borowis @mhmazur There're demonstrated cases where it's possible for GPT-3 to match GPT-4 by just improving the prompt.
Maurício Maia tweet media
English
0
0
1
72
Borys
Borys@borowis·
@mhmazur I don't have quantifiable information, however I don't believe that i.e GPT-3 would be able to perform at GPT-4 level with better prompting with what I've seen and tried so far
English
1
0
1
33
Maurício Maia retweetledi
Avthar
Avthar@avthar·
1/ 🤖 You don’t need a specialized vector database. You just need PostgreSQL. Introducing Timescale Vector: PostgreSQL++ for AI Applications - 📈 3x ANN search performance vs Weaviate, 40%-1,500% boost for pgvector. - ⚙️ No need to learn and manage a separate vector database.
Avthar tweet media
English
12
56
404
129.9K
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
Send me your DALL-E 3 prompts, I’ll share the results here 🧵👩‍🎨
English
759
217
1.8K
889.7K
Maurício Maia
Maurício Maia@mmaia·
@marclou I finally listened to the IH podcast with you. Finding out that B2B is not a founder fit was very relatable.
English
0
0
1
49
Marc Lou
Marc Lou@marclou·
2018: I sold a software I didn't build 2020: I abandoned the startup 2023: It still makes $600/month Passive income is incredible
Marc Lou tweet media
English
21
1
182
36.8K
Maurício Maia
Maurício Maia@mmaia·
📈 And the number o jobs published keep growing like crazy.
Maurício Maia tweet media
English
2
0
0
85