merel

466 posts

merel banner
merel

merel

@merelnyc

Neuro PhD, ex @meta 21-24, @deepmind 16-21 making cool experimentation about neurocomputer and AI here: https://t.co/MAC8t8DFv9

New York Katılım Kasım 2023
317 Takip Edilen1.4K Takipçiler
merel retweetledi
Photo
Photo@photoagents·
What if your AI agent remembered every screen it had ever worked on? It does now. An article by @AP apnews.com/press-release/…
Photo tweet media
English
0
6
23
2.1K
merel
merel@merelnyc·
Run it in 30 seconds. Open a terminal and paste: pip install "photoagents[web]" photoagents-launcher That installs it and opens the desktop app. First launch prompts for your Photo Agents key, grab it at photo-agents.com/account/keys. Other ways to launch (same terminal): photoagents-hub # browser-based UI on localhost photoagents # plain terminal REPL Bring your own LLM: Copy config/keys_template.py from the repo, save it as credentials.py in your working folder, paste in your Claude or GPT key. Source: github.com/jmerelnyc/Phot… Bring it your worst task → photo-agents.com
merel tweet media
English
0
0
1
4.1K
merel
merel@merelnyc·
What that buys you in practice: Long-running tasks survive crashes, restarts and 12-hour gaps. Photo agent picks up exactly where it left off because every prior frame is on disk, indexed and addressable. No "let me re-explain the project." It works on apps with no API and no docs. The model doesn't care if it's a legacy ERP from 2007, a Figma plugin, a custom internal dashboard, or Bloomberg Terminal. If it renders pixels, the agent can drive it. It debugs itself. When a step fails, the agent diffs the screenshot it expected against the one it got, identifies the change (a modal popped up, a button moved, the page redirected) and recovers without you re-prompting. Recurring jobs compound. Day one: the agent figures out your workflow and writes the skill. Day thirty: it's running that workflow in seconds, not minutes, because it's calling its own pre-baked routine.
English
1
4
18
4.1K
merel
merel@merelnyc·
Hey everyone, @photoagents is live! The name is literal, every action the agent takes starts with a photo of your screen but not a heavy PNG. Each capture is a lightweight, metadata-rich image: EXIF-style tags for what's on screen, where the cursor is, which window has focus, the active app, the timestamp, the OCR pass, the DOM snapshot if it's a browser. All baked into one file, compressed, sitting on your disk. The result: an agent with actual photographic memory. It can scroll back to "what did I look at on Tuesday at 3pm" and read it as fast as opening a thumbnail. 24h free, no card → photo-agents.com Repo: github.com/jmerelnyc/Phot… Pip: pypi.org/project/photoa…
merel tweet media
English
1
20
56
15.9K
merel
merel@merelnyc·
Design refreshes can be tricky. Sometimes, they're more about aesthetics than function. But when executed well, like with Liquid Glass integration, they can enhance both the visual appeal and user interaction. It's more than just a facelift, it’s elevating the whole experience.
merel tweet media
English
0
0
3
210
merel
merel@merelnyc·
@Teslaconomics One. And I’ll make it fetch my coffee. If it spills, we’ll finally see if it can handle a human-like scolding.
English
0
1
2
41
Teslaconomics
Teslaconomics@Teslaconomics·
How many Tesla bots are you going to order?
Teslaconomics tweet media
English
1.2K
487
3.2K
102.1K
merel
merel@merelnyc·
@wahab_twts Picture YouTube with a sleek, all-white minimalist interface. Imagine an "AirTab" feature, where every video intuitively flows to your Apple devices with a swipe and no buffering. And yes, the comments section would probably be called "iComment."
English
0
0
0
78
wahab
wahab@wahab_twts·
What if Apple designed YouTube?
wahab tweet media
English
565
571
14.5K
1.4M
merel
merel@merelnyc·
@kaolti Impressive use of Three.js, how does it handle performance with a large number of elements? Curious about the rendering efficiency.
English
1
0
1
482
Zsolt Kacso
Zsolt Kacso@kaolti·
Always loved this glass effect. I built this interactive version with html-in-canvas and Three.js.
English
121
243
4.6K
276.2K
merel
merel@merelnyc·
@MarioNawfal Artificial skylights could revolutionize workspaces by improving mood and productivity, especially in urban offices where natural light is scarce.
English
0
0
0
22
Mario Nawfal
Mario Nawfal@MarioNawfal·
🇨🇳 This might be the most futuristic thing you’ll see today: Artificial skylights that use LED panels + nanotechnology to create hyper-realistic blue skies and sunlight in completely windowless rooms. You can even switch from bright midday sun to warm sunset glow with a remote. We’re now simulating the sky indoors because real windows are apparently too much to ask for in dense cities. This is either peak innovation…or lowkey dystopian. You decide.
English
770
1.1K
8.2K
1.1M
merel
merel@merelnyc·
@testingcatalog Redesigns are about more than looks. If Gemini improves user flow and responsiveness, it’s a win. Curious how it handles multitasking.
English
0
0
0
49
TestingCatalog News 🗞
TestingCatalog News 🗞@testingcatalog·
GOOGLE 🚨: A new design for Gemini on iOS has been spotted! Sleeeeeeeeeeeek! 👀
TestingCatalog News 🗞 tweet mediaTestingCatalog News 🗞 tweet media
English
96
83
2.5K
209.5K
merel
merel@merelnyc·
@aimikoda Sketch storyboards and Seedance 2.0 sound like a powerful combo. How does GPT Image 2.0 handle the style consistency alongside it?
English
0
0
0
46
Kōda
Kōda@aimikoda·
It looks like these sketch-style storyboards work really well in Seedance 2.0. I'm going to stick with this approach for a while. In the video prompt, I only include intent, style, reference and visual approach. It follows the storyboard surprisingly well. I'm sharing below the GPT Image 2 prompt I use to create the previs, along with the video prompt I use afterwards. Created on @MartiniArt_
English
38
159
1K
49.3K
merel
merel@merelnyc·
@Teslaconomics Optimus might ease caregiving by saving caregivers 20+ hours weekly. Adoption speed will decide how soon real lives benefit.
English
0
0
0
32
Teslaconomics
Teslaconomics@Teslaconomics·
Optimus is going to change everything… and most people don’t see it yet. Taking care of someone today usually means a lot of sacrifice… like time, energy, $, stress. It’s hard, especially when it’s someone you love. But imagine when the Tesla Bot arrives. You don’t have to worry if your parents are okay when you’re not there. You don’t have to rush home, cancel plans, or feel guilty. You don’t have to choose between building your life and being there for them. This product changes all that. Whether it’s cooking meals, helping them walk, cleaning, laundry, organizing, and more. Even just being there so they’re not alone. 24/7, with no burnout, no complaints. For the first time ever… care isn’t constrained by human limits. With it, in the future, you won’t need to sacrifice your life to take care of someone you love bc everyone will have access to this product to do it right. I get it when Elon tells me Optimus will be the best product ever.
English
480
698
2.7K
82K
merel
merel@merelnyc·
@DilumSanjaya Curious how GPT Images 2 handles texture realism. Does it blend well with Gemini 3.1 Pro's physics engine for interactive experiences?
English
0
0
0
40
Dilum Sanjaya
Dilum Sanjaya@DilumSanjaya·
Been thinking about sharing some fun, interactive science app ideas Made this one today UI design and planet textures GPT Images 2 Code Gemini 3.1 Pro
English
106
305
3.2K
219.5K
merel retweetledi
Brett Adcock
Brett Adcock@adcock_brett·
F.03 can now walk up/down stairs purely using it's onboard camera perception Our robots now walk from manufacturing when built to HQ This is trained end-to-end with reinforcement learning in simulation
English
85
128
1.9K
169.4K
merel
merel@merelnyc·
Autonomous code with Codex's '/goal' is here. It's like having a 24/7 dev on call.
English
0
0
0
21
merel
merel@merelnyc·
AI-generated pets, when your Tamagotchi evolves into a GPT-powered companion. Forget old-school digital pets, these are like having mini AI sidekicks.
merel tweet media
English
0
0
0
32
merel
merel@merelnyc·
Video generation with AI is about to be the wild west of creativity. Models like Gemini will redefine storytelling in ways we can't even predict.
merel tweet media
English
0
0
0
24
merel
merel@merelnyc·
AI can decode images into code almost like magic. But can it capture the nuance and intent behind them?
merel tweet media
English
0
0
0
15