FlyingIkki

7.7K posts

FlyingIkki

@FlyingIkki

Katılım Nisan 2018

2.5K Takip Edilen336 Takipçiler

FlyingIkki retweetledi

Unreal Engine@UnrealEngine·1d

Unreal Engine 5.8 ships today with experimental MCP server support: Your sources, your pipeline and your workflow—simply configure the MCP plugin and connect to any agent. Get familiar with the MCP server and the PCG Primitive Plugin today and see what teams can build together: epic.gm/ue-5-8-blog

English

215

767

6.5K

2.3M

FlyingIkki retweetledi

Chainlink@chainlink·1d

NEW: Top-10 crypto exchange with 120M+ users, @okx, adopts Chainlink to unlock the $80 trillion tokenized RWA opportunity on X Layer. Chainlink enables devs to create advanced apps, bringing the agentic economy & high-speed DeFi to Chainlink Scale member @XLayerOfficial.

English

282

1.1K

107.8K

FlyingIkki retweetledi

Michaël van de Poppe@CryptoMichNL·17h

Slowly, but surely, $LINK is growing and growing. Another massive partner added to the ecosystem, it's @okx. Very happy to see this news between the two parties, and very bullish for the ecosystem as a whole.

Chainlink@chainlink

English

421

54.8K

FlyingIkki retweetledi

ÖRR Blog.@OERRBlog·20h

Elon Musk ist ein Trottel und Faschist, Tesla Fahrer sind Arschlöcher, Vergleich von Tesla Aktionären mit dem Inzest-Verbrecher Josef Fritzl. Jan Böhmermann arbeitet für das ZDF. #OerrBlog

Deutsch

593

940

5.5K

121.4K

FlyingIkki retweetledi

vittorio@IterIntellectus·6h

this is actually incredible a full body ultrasound scanner that takes 60 seconds instead of spending an hour in an MRI tube, without radiation, hospitals or a $2000 bill soon you’ll just walk into a health spa, order a coffee, step into the pod, and walk out with a 3D map of your body the future is finally starting to look like the future

Midjourney@midjourney

A technical dive inside our new "Midjourney Scanner"

English

148

690

9.1K

652.9K

FlyingIkki retweetledi

Hassan@nutlope·20h

This model is insane at design. I asked GLM 5.2 (left) and Opus 4.8 (right) to build me a landing page and you can't even tell the difference. GLM cost $0.06 while opus cost $0.49. More than 6x cheaper while being faster + more token efficient. Another win for open source AI.

Z.ai@Zai_org

Introducing GLM-5.2: Frontier Intelligence, Open Weights - Significant improvements in coding and agentic tasks - Strong long-horizon capabilities with a 1M context window - Two levels of reasoning effort: GLM-5.2 (max) pushes the limits, while GLM-5.2 (high) strikes a strong balance between performance and token efficiency - MIT-licensed open weights - Same API pricing as GLM-5.1 Tech Blog: z.ai/blog/glm-5.2 Weights: huggingface.co/zai-org/GLM-5.2 API: docs.z.ai/guides/llm/glm… Coding Plan: z.ai/subscribe Chat: chat.z.ai

English

250

378

5.8K

860.8K

FlyingIkki retweetledi

Chubby♨️@kimmonismus·6h

The Midjourney medical thing is genuinely strange and I kind of love it. The plan is a spa. Hot tubs, saunas, cold plunges, open 24/7, first location in San Francisco in 2027. You step into a shallow pool of water, sink slowly through a ring of half a million tiny ultrasonic sensors, and in about 60 seconds you walk out with a 3D map of your insides down to a fraction of a millimeter. No magnets, no radiation, no contrast, just sound waves and warm water. Compare that to how we do this now: They say it's close to 100x faster than an MRI ("60 seconds"). For context, a normal MRI in the US averages around $1,300 and the scan alone can take over an hour inside a loud metal tube. A full-body scan from Prenuvo runs about $2,500 for roughly the same hour. Midjourney wants to flip the whole feeling of it. Build a place you'd want to visit even if there were no scanner, then collect the health data as a side effect. I have no idea yet if the tech delivers what they claim. But the framing is smart. The hardest problem in preventive health has always been getting people to actually show up, and a spa solves that better than a hospital ever will.

Midjourney@midjourney

Announcing a new division of Midjourney called "Midjourney Medical"

English

815

73.7K

FlyingIkki retweetledi

Alok@analogalok·18h

Google's Gemma 4 26B A4B QAT hits 25+ tokens/sec and 320+ tokens/sec prefill on 8 GB VRAM (RTX 4060) + 16 GB RAM using TurboQuant Prefill just went from 200 → 320+ tok/s on the same 8GB card. 1.6x, no new hardware, no new quant, just a KV cache trick stacked on top of the Gemma 4 26B MoE setup from a few days ago. A few days ago I posted Gemma 4 26B A4B hitting 28 tok/s decode on 8GB VRAM using native MTP. prefill was stuck around 200 tok/s. fair callout by the community. So today I tested something I'd already been meaning to try: TheTom/llama-cpp-turboquant, the TurboQuant KV cache fork by Tom Turney (@no_stp_on_snek). (github link in the comments) thanks to him, the fork just got resynced to mainline, so MTP + TurboQuant now run together cleanly (I didnt see any meaningful gains by using MTP with this setup though but you can try). The flags (No MTP): -m gemma-4-26B-A4B-it-qat-UD-Q4_K_XL.gguf -cnv -c 64000 --cache-type-k q8_0 --cache-type-v turbo3 Results on the same RTX 4060 8GB, tested with a 27k token prompt at 64k context loaded: Prefill: 200 tok/s → 320+ tok/s Decode: stayed above 25 tok/s (without MTP) Why it works: TurboQuant uses walsh hadamard rotation + polar quantization on the KV cache. keys are sensitive to compression, values aren't much, so it splits the difference: K stays at q8_0, V drops to turbo3 (~3 bits). bonus from the memory savings: same 8GB card can now stretch to 100-120k context with minimal decode penalty. It should now be snappier with any agent harness such as hermes agent without compromise on intelligence. If you're already running Gemma 4 on a small card, this stacks on top for free. Try --cache-type-k q8_0 --cache-type-v turbo3 on your setup and report back what your prefill/decode split looks like. unsloth model gguf and llama.cpp turboquant fork links in the comments. what's your prefill number before vs after?

Alok@analogalok

Run Gemma 4 26b MTP on 8 GB VRAM GPUs at 25+ tokens/second. Flags included! local llm space is moving at terminal velocity. only 3 days ago google released gemma 4 26b a4b qat quants. more efficient than before, ran on 8gb vram at 20 tok/sec. and now just a few hours ago, mainline llama.cpp merged a massive update and we just shattered our own record. decode throughput went 25-40% up on the same 8 GB VRAM setup! Before MTP: 20 tps -> After MTP: 28 tps! llama.cpp just officially merged PR #23398 ("add Gemma4 MTP"), bringing native Multi-Token Prediction (MTP) support to Gemma 4 models. By running speculative drafting on the same 8GB VRAM RTX 4060 setup, my decode throughput on a 64k context instantly leaped to a blistering 25–27 tokens/sec thats 25-30% increase with the same hardware. Here is the architectural catch you need to know: Unlike the Qwen 3.5 and 3.6 series, which bake the MTP heads directly into the base GGUF, the Gemma 4 MTP head is not built in. You must download a separate, specialized MTP drafter GGUF (the assistant model) to act as the speculator. (I've dropped the download link in the replies). copy and try the exact flags: -m gemma-4-26B-A4B-it-qat-UD-Q4_K_XL.gguf --spec-type draft-mtp --spec-draft-n-max 6 --spec-draft-p-min 0.7 --spec-draft-model gemma-4-26b-A4B-it-assistant-Q4_0.gguf -c 64000 -v n-max 4 and p-min 0.7 is also worth checking out. benchmark on your setup and workflow. if you have a single 8 gb vram nvidia rtx 4060, 3060, 3070, 2080, 2070, grab the MTP drafter GGUF link in the comments and try it yourself. Check it out even if you have asmaller or a larger gpu, such as a single rtx 3090, 4090, 3060, 2060. MTP works for all gemma 4 sizes such as gemma 4 12b, gemma 4 31b etc. but remember to grab the correct mtp draft assistant models respectively. what are you benchmarking today

English

456

44.4K

FlyingIkki retweetledi

Google Gemma@googlegemma·18h

Teamwork makes the dream work. Now running locally. Watch Gemma 4 26B orchestrate 10 parallel sub-agents to code an SVG art gallery in seconds. Hitting 100+ tokens/sec, imagine how you can scale this for complex tasks or local chatbots for entire teams!!

English

129

1.6K

110.1K

FlyingIkki retweetledi

Larissa Fußer@larissafusser·23h

uuuuuuund es geht schon wieder los! Liebe Antifa, da Ihr uns jetzt ja offenbar beobachtet: Ihr erkennt uns an den sitzenden Klamotten, frisierten Haaren und am frisch geduschten Geruch. Liebe Grüße

Max Mannhart@maxmannhart

Es geht in die nächste Runde: Mit einer Kundgebung vor unserem Redaktions-Büro möchte uns die linke Szene aus der Stadt vertreiben. Illustriert wird der Aufruf mit einem Flugzeug, das vor unserer Hausfassade einschlägt - es ist das Foto eines Flugzeugs, das der Pilot einst in ein Stadion in Baltimore steuerte. Was auch immer die friedliebende Zivilgesellschaft damit sagen will. Eine Fahrraddemo will unsere neuen Redaktionsräume markieren und danach unsere ehemaligen Standorte sowie die Kollegen von @niusde_ besuchen und einschüchtern. Ich finde es toll, dass wir uns auf die PR-Kraft der Szene verlassen können. Danke an alle Fans und den polizeilichen Schutz! Übrigens, liebe Genossen, wir sind nicht umgezogen, weil ihr uns erfolgreich vertrieben habt. Wir haben ein neues, wesentlich größeres Büro, weil uns immer mehr Menschen lesen, schauen und mögen. Diesen Menschen fühlen wir uns verpflichtet. Und wir lassen uns ganz sicher nicht von irgendwelchen zugezogenen Soziologie-Studenten im vierzehnten Semester von unserem Weg abbringen.

Deutsch

216

24K

FlyingIkki retweetledi

Avid@Av1dlive·1d

in 9 minutes on the Tokyo stage, Angela Jang, head of product for the Claude platform, said the part most builders still haven't figured out: "a model is only as good as the context that you actually give it." that's the whole game now. not the prompt. the context. what your agent remembers. what skills it can pull. what it learns from its own past runs. anthropic now ships all three: memory, skills, and dreaming. agents that inspect their own trajectories and self-improve. demoed live this week. the prompt was never the bottleneck. the context was. most people are still tweaking words in a chat box. the ones who get this engineer the context their agents run on. watch the 9 minutes. then read the full breakdown below ⬇️

Rahul@sairahul1

x.com/i/article/2067…

English

109

29.7K

FlyingIkki retweetledi

Julian Reichelt@jreichelt·21h

Wir sind in die Hände von Verrückten geraten.

Frédéric Schwilden@totalreporter

In der Zukunft wird jeder Bundesminister für 15 Minuten Darsteller in einer Scripted-Reality-Show sein.

Deutsch

229

483

3.7K

73.9K

FlyingIkki retweetledi

Frank Thelen@frank_thelen·1d

Deutschland ist zu langsam für die KI-Welt und das müssen wir ändern 🇩🇪 Mit BILD habe ich beim Money Mittwoch darüber gesprochen, wie wir in Europa aktuell mit dem Thema KI umgehen, wo wir gerade die größten Fehler machen 🚨 und was sich ändern muss, damit wir langfristig wieder vorne mitspielen können! Spoiler: Es geht nicht um fehlendes Talent oder fehlende Ideen, sondern um die Umsetzung. Es geht um Tempo und Mut zu echter Veränderung. Während andere Länder Milliarden in KI-Infrastruktur pumpen und Regulierung als Standortvorteil begreifen, diskutieren wir in Deutschland noch über Datenschutz-Bedenken, lange bevor das erste Produkt überhaupt live ist 🐌 Und während Talente aus Deutschland nach London oder ins Valley abwandern, weil dort Kapital und Geschwindigkeit aufeinander treffen, fehlt uns hier oft schon der Mut für die erste Finanzierungsrunde.

Deutsch

146

14.6K

FlyingIkki retweetledi

darkzodchi@zodchiii·1d

Google CEO, Sundar Pichai: "If you don't teach your agents to debug themselves now, you will keep wasting hours every week." In 30 minutes he explains why the engineers pulling ahead let agents fix their own failures instead of doing it themselves. Watch the talk, then save the exact setup below👇

darkzodchi@zodchiii

x.com/i/article/2067…

English

527

136.6K

FlyingIkki retweetledi

ÖRR Blog.@OERRBlog·23h

Die BBC berichtet ausführlich, dass das ZDF eine Unterlassungserklärung nach der Falschberichterstattung über Elon Musk abgegeben hat. Auf der ZDF heute Seite findet sich nur eine Kurzmeldung im Bereich Korrekturen. #OerrBlog

Deutsch

275

22.8K

FlyingIkki retweetledi

Alex Finn@AlexFinn·22h

I was wrong I've been saying for months that open source AI models are 6 months behind frontier They caught up. GLM 5.2 is as good as Opus 4.8 This changes everything. If you run GLM 5.2 locally no government can take it away. You become sovereign And even if you run through APIs, its a fraction of the cost The battlefield is different now. If open source is as good as frontier, and people have cheaper alternatives, governments can't be as quick to regulate. It will destroy the frontier AI labs All of this is such a massive win for the people If you are not paying attention to local models yet, you are making a tremendous mistake

Z.ai@Zai_org

English

342

441

4.6K

745.5K

FlyingIkki retweetledi

Benedikt Brechtken@ben_brechtken·1d

Exklusiv: Apollo News war dabei, als in einer Erfurter Kirche die Blockade des AfD-Parteitages geplant wurde. Eine Blockade, die sich für geltende Gesetze nicht interessiert: In unmissverständlichen Worten erklärte die Aktivistin am Podium das Selbstverständnis der Organisation: „Die Idee ist schon (…) ein bewusster Regelübertritt, also wir überschreiten sozusagen Gesetze, die gerade gelten, weil wir es für legitim erachten!“ Sie fuhr fort: „Weil wir es für legitim halten, übertreten wir die Regeln und stellen uns ihnen [der AfD] in den Weg, auch wenn die Polizei sagt, dass wir wegbleiben sollen!“ Bemerkenswert ist dabei auch, wie offen die Antifa-Gruppe mit diesem Vorgehen umgeht. Transparenz beim Rechtsbruch wird offenbar als Tugend verstanden: „Wir sagen, was wir machen. Wir kündigen das an. Wir haben es überall gesagt und wir halten uns einfach an das, was wir sagen!“ apollo-news.net/widersetzen-gr…

Deutsch

157

994

3.1K

46.2K

FlyingIkki retweetledi

Elon Musk@elonmusk·1d

Just follow SpaceX if you want news about our company

Muskonomy@muskonomy

NEWS: SpaceX will break its own news on X, not on newswires, an SEC filing shows The company named its X account @SpaceX as an official disclosure channel. Its investor page at ir.spacex.com is the other. SpaceX says it will skip wire services like Business Wire and PR Newswire. It tells investors to follow @SpaceX for material updates.

English

4.5K

10.7K

95K

24M

FlyingIkki retweetledi

Elon Musk@elonmusk·1d

Grok on Bedrock

Matt Garman@mattsgarman

Excited to share that Grok 4.3 from @elonmusk's @xai is now available on Amazon Bedrock. Customers pick the right model for the right job, and we keep making that easier: aws.amazon.com/about-aws/what…

English

1.5K

3.2K

20.2K

3.1M

FlyingIkki retweetledi

Markus Haintz@Haintz_MediaLaw·2d

Passend zum Thema Debanking versucht eine große deutsche Sparkasse, mir ein Privatkonto zu kündigen. Die Kündigung habe ich mangels Vollmacht zurückgewiesen, damit ist sie unwirksam. Mal sehen, ob die Bank es erneut versucht. Falls ja, werden wir das gerne öffentlich und gerichtlich ausdiskutieren. Ein "sachgerechter Grund" für die Kündigung liegt nicht vor, wäre aber Voraussetzung. Für den Moment gebe ich der Bank die Gelegenheit zum gesichtswahrenden Rückzug. Interessant ist auch, dass eine Abteilung der Bank mir vorher mitgeteilt hat, dass mit dem Konto und der Nutzung alles in Ordnung sei. Eine andere Abteilung hat praktisch zeitgleich die Kündigung ausgesprochen.

Markus Haintz@Haintz_MediaLaw

Debanking in Deutschland: Wenn die "falsche" Meinung zum Problem wird und wie man sich dagegen wehrt – HAINTZmedia auf YouTube: youtube.com/watch?v=85Bzec… Janine Beicht im Gespräch mit Rechtsanwältin Melina Schwendenmann von Haintz legal zum Thema Debanking. Unsere mediale Arbeit unterstützen, siehe Link im ersten Kommentar.

Deutsch

324

1.9K

8.2K

302.1K

Keşfet

@okx @XLayerOfficial @no_stp_on_snek @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates