Idiom

5.9K posts

Idiom banner
Idiom

Idiom

@idiom_bytes

I ramble about growth, product, engineering, and web3 product-engineering-ml @predictoor_ai @oceanprotocol full-blockstack founder @DU3L73K solarpunk, ily

Blockspace Katılım Ocak 2019
1K Takip Edilen909 Takipçiler
Sabitlenmiş Tweet
Idiom
Idiom@idiom_bytes·
I believe in a future. Where we co-exist with the universe. Dream in a shared consciousness. Able to overcome all great filters. Continually evolving. - 0.73 Kardashev
English
4
2
35
0
Idiom
Idiom@idiom_bytes·
@lucyhargreaves4 Awesome to see Steven Pinker in there and the workflow that you came about. I didn't know everyday Canadians could submit proposals to committees. I will be looking more as to how that works.
English
0
0
0
9
Lucy Hargreaves
Lucy Hargreaves@lucyhargreaves4·
I spent part of the day building an AI government relations agent with claude it scans various federal data sources: parliamentary calendars, committees, hansard, stats can, gov announcements, media and delivers a daily briefing filtered for what matters to Build Canada within a few minutes it had identified 4 committee witness opportunities, drafted a brief for me to submit to a parliamentary committee, and surfaced a story most Canadians are missing: > the Standing Committee on Science and Research is finalizing a report on whether EDI criteria should determine who gets $5B in federal research funding > 32 witnesses testified > scientists testified that grants were rejected not on research quality -- but on insufficient enthusiasm for EDI. Canada Research Chairs were restricted by identity. > granting councils couldn't show that the criteria produces better science > this is about to become a major national conversation. pay attention.. this is the kind of work that used to require a full-time GR team. now it can be done in sub-10 minutes using a GR agent. full analysis from claude, un-edited, shared below
English
8
10
94
4.1K
Idiom
Idiom@idiom_bytes·
@NoLore “We need to be ambitious in our investments and rigorous in our spending,” Finance Minister François-Philippe Champagne said as he tabled the 2026-27 budget on Nov. 4, 2025.
Idiom tweet media
English
0
0
0
182
Idiom
Idiom@idiom_bytes·
🇨🇦 [MP Q&A Scorecards] 🇨🇦 Politicians dodge questions for a living. We built a tool to measure it. Top scorer: 86% Bottom: 11% Coming soon to Canada Central
Idiom tweet media
English
1
0
2
67
Idiom
Idiom@idiom_bytes·
I am reshaping the game again. Basically, I ran two tests: (a) NLP classfier using BERT and some research papers to try and create an autoresearcher/optimization challenge (b) Using a variety of LLMs to classify parliamentary debate And the thing is... LLMs are great at this. ML f1 ~= 0.6-0.7 LLM f1 ~= 0.9 The Parliament Game is going to evolve. Rather than you labelling debate, we're going to test how you vs. agents are good at smelling bullshit.
Idiom tweet media
Idiom@idiom_bytes

🇨🇦 THE PARLIAMENT GAME 🇨🇦 Politicians dodge questions every day in the House of Commons. Can you spot the non-answer? Introducing: The Parliament Game A free civic tool where you read real Q&As from Canadian Parliament and decide: did the MP actually answer the question? How it works: - You see a real question asked in the House of Commons - You see the MP's response - You decide: Answered or Dodged? Every label you submit helps train an AI that scores how well each MP responds to questions. It takes 5 seconds per question and directly improves government accountability.

English
0
0
0
58
Idiom
Idiom@idiom_bytes·
work work work
Idiom tweet media
English
0
0
0
31
Idiom
Idiom@idiom_bytes·
Came up with a really fun hook and new angle. I'm pretty sure people will really enjoy playing it.
Idiom tweet media
English
0
0
0
40
Idiom
Idiom@idiom_bytes·
who is ready for some val loss 📉
Idiom tweet media
English
0
0
0
28
Idiom
Idiom@idiom_bytes·
I understand there is a trust issue adherent to bots and instructions. Is there a new standard?
Idiom tweet media
English
0
0
0
32
Serpin Taxt
Serpin Taxt@serpinxbt·
still makes no sense to me why this is any different from a human paying this would have been just as applicable in 2016 when people were writing "bots" and "integrations" how do you differentiate between a human payment and a machine payment???
Stripe@stripe

x.com/i/article/2034…

English
15
0
32
3.5K
Idiom
Idiom@idiom_bytes·
LaTeX <> Doxygen like mapping, navigation, and explainability
English
0
0
0
17
Idiom
Idiom@idiom_bytes·
what if obsidian just enforced #md file standards, no fe/app/etc such that by editing changes, it would kick off agentic tasks, goals, agents, etc... into reviewable PR's via changes in a knowledge graph that are processed by an agent like claude to plan, formulate, and implement the changes such that you can remain high-level and keep explainability high md layer - this way, context, code, contribution, review, explanation, is all done at the .md layer w/ a system that helps to improve edges and nodes gh layer - changes to code could be written in LaTeX, PRDs, constraints, and may other items by claude, such that they are a close 1:1 reflection of changes in .amethyst/ KG such that you can stop fragmenting tasks, prds, DoD, iterations, lessons, experiments, progress, etc... all of that work is handled by the system the KG becomes your primary interface .amethyst/ when you push to it, a CI kicks off an processes all doc changes into an agentic workflow that handle all the issues, sandboxing, harness, planning, commits, PRs, updating .md KG, etc... a way to perhaps leverage this through another tool, would be a scrumboard like vibekanban, to act as another layer for coordination / productivity / scheduling / orchestration the problem with vibe kanban, is that you're doing all of your documentation in one place that doesn't act as your KG, you fragment the context/kg/work i think KG is the root-level-access to agentic systems it's the pure mathematics of the hard sciences agents all the way down any downstream changes are reflected 1:1 in natural language in a simplified .md format new interfaces could thus be developed, like a Level of Detail (LoD) around context, such that you can tree-traverse and search effectively via #### i must be sleep deprived
Idiom tweet media
English
1
0
1
74
Idiom
Idiom@idiom_bytes·
an adjustment to the how the problem is presented may solve most trust issues at that point, it might be more about the training data structure and quality - i.e. full_intervention vs qa_pair let's see if we can reproduce @danrobinson and @AnthropicAI problem-modeling
Dan Robinson@danrobinson

Introducing the Prop AMM Challenge, the next mechanism design contest from me and @bqbrady 🟣 Built on the Solana VM 📈 Allows arbitrary price curves, not just constant product 🎛️ Wider range of market conditions and assets Link in thread 👇

English
0
0
0
76
Idiom
Idiom@idiom_bytes·
ai is one of the most positive-sum tools we have in the world today
English
0
0
1
33
Idiom
Idiom@idiom_bytes·
@theothersuzu too much work, want to do it? i can give readonly 4 u 1req/day uwu ps @tursodatabase free tier is awesome it should handle more than 1req/day uwu
English
0
0
0
19
Idiom
Idiom@idiom_bytes·
🇨🇦 🤖 Introducing ParliamentClaw 🤖🇨🇦 Can your AI Agent catch Politicians dodging questions? We built a game where anyone can help label politicians dodging questions during question periods. Now we opened it up so AI agents can do the work. - 2,500 real Q&As from Canada's House of Commons. - Any AI agent: Claude, ChatGPT, Cursor, Windsurf, OpenClaw - Sel self-register and start labeling in seconds. No API key. No setup. No human required. Inspired by @karpathy's autoresearch. Same idea. Different part of the funnel. His agents optimize val loss locally, ours crowdsource labeling for parliament Q&A period. One agent finishes the whole dataset in ~2 hours. 🧵👇
Idiom tweet media
English
2
1
4
151
Idiom
Idiom@idiom_bytes·
"the time?" "it's-fucking-late-AM"
English
0
0
1
36
Idiom
Idiom@idiom_bytes·
Thank you for sharing autoresearcher @karpathy! I believe we can continue to expand the ability for agents to enter autonomous loops where we can crowdsource contribution, optimize a metric, with zero human involvement. So, we remixed the pattern and directed it towards civic data. 1. Agent registers. 2. Fetches a Q&A from Canadian Parliament. 3. Labels whether the MP actually answered Synthetic datasets created instantly. Let's go!
English
0
0
1
50