Marco Fahmi

2.7K posts

Marco Fahmi

@dataronin

Mostly lurking.

Brisbane, Australia Katılım Ocak 2011

913 Takip Edilen411 Takipçiler

Marco Fahmi retweetledi

LSE Impact Blog@LSEImpactBlog·26 May

✍️Dorothea Strecker, Heinz Pampel, Rouven Schabinger and Nina Leonie Weisweiler, explore how common data repository shutdowns are and suggest what can be done to ensure data preservation in the long-term. #OpenData wp.me/p4m9em-cPR

English

2.9K

Marco Fahmi retweetledi

Jeni Tennison@JeniT·26 May

I am really troubled by the proposal from the Tony Blair Institute for a "National Data Trust" (NDT) for health data, particularly by the idea that the next government might actually go for it. institute.global/insights/polit…

English

5.2K

Marco Fahmi retweetledi

giulio quaggiotto@gquaggiotto·16 Nis

“When we have a technology that treats something as simple and fundamental as our name as an error, it robs us of our personhood" theatlantic.com/technology/arc…

English

333

Marco Fahmi retweetledi

giulio quaggiotto@gquaggiotto·26 Ara

Transformative governance of innovation ecosystems sciencedirect.com/science/articl…

English

772

Marco Fahmi retweetledi

National Centre for Youth Substance Use Research@NCYSUR·15 Ara

Congratulations team NCYSUR who is being awarded a 1.5 million dollar @nhmrc ideas grant, led by Dr Daniel Stjepanovic, @tianzesun , A/Prof Phong Thai, and Prof. @DavidHammondPhD.

National Centre for Youth Substance Use Research tweet media

English

838

Marco Fahmi retweetledi

NHMRC CRE on Achieving the Tobacco Endgame@CREtobacco·15 Ara

Congratulations 🎉 to CRE researcher Dr Carmen Lim @Cwernlim who has been awarded $660,000 by the @nhmrc to develop a program to understand how pro-vaping campaigns on social media influence young peoples' attitudes towards the use of e-cigarettes.

English

875

Marco Fahmi retweetledi

Kevin Gee@kevg1412·17 Nis

"He's not even a 10x engineer. He's like, 100x, or 1,000x engineer" Gmail creator Paul Buchheit on how Bret Taylor once rewrote Google Maps in a single weekend:

English

692

9.1K

2.4M

Marco Fahmi retweetledi

Axios@axios·24 Ağu

Behind the hype of generative AI, large companies are struggling to deploy the new technology — hitting cost and data management hurdles that are leaving many of their generative AI projects stuck in pilot phase. trib.al/zwRlIVl

English

12.1K

Marco Fahmi retweetledi

Axios@axios·24 Ağu

As adoption of generative AI grows, providers are hoping that greater transparency about how they do and don't use customers' data will increase those clients' trust in the technology. trib.al/tCr3DVd

English

12.1K

Marco Fahmi retweetledi

Rachel Woods@rachel_l_woods·18 Ağu

There's a resurgence of interest in fine tuning LLMs I've yet to see a successful public use case where fine tuning > prompting. But here's where I see fine tuning *mattering*: First, fine tuning is for teaching an LLM specific tasks or behaviors Not teaching an LLM new knowledge. For new knowledge, use Retrieval (store your data in an outside database and strategically pull the right chunks in to give the LLM context to your question) But even in teaching LLMs specific tasks or behaviors - here's the catch... LLMs are remarkably good at picking up tasks and behaviors from just a good prompt THIS is what makes LLMs mind blowing after all So that begs the question. Where is fine tuning actually helpful? Some use cases I could see developing are teaching LLMs tasks that are exceptionally difficult to describe, or fit into ~10 examples you can add to a prompt. One way to think about this: if it would take someone a few weeks doing a task to 'master it' instead of being able to read training materials and get the picture... That *may* be a use case for fine tuning But proceed with caution To truly teach an LLM a new behavior or task, you'll need to treat this like a machine learning project, not just throwing examples in and getting magic in return (which it still blows my mind that ChatGPT does this so well for us). Things like: - Dataset design - Training and test data - Overfitting + more as the tooling around fine tuning gets more sophisticated The other obvious use case is cost. If you can get a super small language model to do a task instead of GPT-4, there's meaningful cost savings there. And if you're using a language model to do large scale tasks like triaging your customer support inbox, or analyzing public data for insights The costs can add up. But if you're wondering where the heck to invest in fine tuning... My answer at the moment for most businesses is still: Make sure you can't do it with prompts.

English

452

164.1K

Marco Fahmi retweetledi

Meredith Whittaker@mer__edith·17 Ağu

📢NEW PAPER! Where @davidthewid, @sarahbmyers & I unpack what Open Source AI even is. We find that the terms ‘open’ & ‘open source’ are often more marketing than technical descriptor, and that even the most 'open' systems don't alone democratize AI 1/ papers.ssrn.com/sol3/papers.cf…

English

618

1.8K

634.9K

Marco Fahmi retweetledi

Bellingcat@bellingcat·7 Ağu

New updates on Bellingcat's #github this week. The 'whisperbox' API receives audio or video URLs and returns the video transcripts using OpenAI's Whisper model. Designed by Bellingcat discord member github.com/fspoettel Find the tool at: github.com/bellingcat/whi…

English

67.6K

Marco Fahmi@dataronin·5 Ağu

Opinion Paper: “So what if ChatGPT wrote it?” Multidisciplinary perspectives on opportunities, challenges and implications of generative conversational AI for research, practice and policy sciencedirect.com/science/articl…

English

Marco Fahmi@dataronin·29 Tem

Tackling generative AI’s sustainability problem diginomica.com/tackling-gener…

English

Marco Fahmi retweetledi

Daniel Severo@_dsevero·28 Tem

Best poster award

English

120

3.3K

28.1K

2.6M

Marco Fahmi retweetledi

Global Investigative Journalism Network@gijn·26 Tem

🗺️ NEW! Can journalists rely on #AI chatbots to successfully geolocate an image? Investigative journalism group @bellingcat conducted 3️⃣ tests to examine the geolocation capabilities of #Bing and #Bard. Reporter @DennisKovtun broke down the findings: ⬇️ buff.ly/3O7PmFu

English

2.6K

Marco Fahmi retweetledi

Peter Griffin@petergnz·26 Tem

Department of Internal Affairs has put out some guidance today on use of generative AI in the public sector...

English

8.9K

Marco Fahmi@dataronin·23 Tem

What will the federal government do with generative AI? nextgov.com/artificial-int… via @Nextgov

English

Marco Fahmi retweetledi

Owen Boswarva@owenboswarva·21 Tem

US judge finds flaws in artists' lawsuit against #AI companies reuters.com/legal/litigati… #StableDiffusion #genAI #AIlaw #IPlaw #webscraping #openweb Andersen et al v. Stability AI Ltd. et al dockets.justia.com/docket/califor… + courtlistener.com/docket/6673212…

English

394

Marco Fahmi retweetledi

Justin Alvey@justLV·18 Tem

I “jailbroke” a Google Nest Mini so that you can run your own LLM’s, agents and voice models. Here’s a demo using it to manage all my messages (with help from @onbeeper) 🔊 on, and wait for surprise guest! I thought hard about how to best tackle this and why, see 🧵

English

366

2.5K

13.9K

1.7M

Keşfet

@nhmrc @tianzesun @Cwernlim @davidthewid @sarahbmyers @bellingcat @Nextgov @onbeeper