modeerf

14.9K posts

modeerf

@modeerf

真正的發現之旅不在於尋找新的風景，而是擁有新的眼光。

Taiwan, Taipei City Присоединился Haziran 2008

3.9K Подписки505 Подписчики

modeerf ретвитнул

ryan 🌊@ryanli·10 Nis

x.com/i/article/2042…

ZXX

307

84.1K

modeerf ретвитнул

Sawyer Merritt@SawyerMerritt·11 Şub

NEWS: xAI has just publicly posted the full 45 minute all-hands meeting that Elon Musk had with employees recently.

English

200

724

modeerf ретвитнул

HAL2400 | AI Visual Creator@HAL2400AI·3 Kas

AIで「千尋の冒険」っていう架空のゲームのプレイ動画を作ってみた。自分が遊びたい世界を、30分で形にできる。ほんとAIの魔法だと思う。

日本語

357

2.8K

36.4K

6.4M

modeerf@modeerf·13 Tem

The next frontier for AI innovation should focus on transforming end-to-end workflows within vertical industries into step-by-step, iterable, and continuously optimizable `Agentic` processes.

English

modeerf@modeerf·5 Tem

So, would it be fair to say that those active AI experts from China on Twitter and YouTube are intensely keeping up with new AI terminology like Context Engineering?

English

modeerf@modeerf·5 Tem

I get the feeling that AI is making a once-impractical playbook suddenly viable: rapidly assembling—or repackaging existing product lines into—something that resembles a newly funded startup, using it as a software marketing strategy to gain attention and collect GitHub stars.

English

modeerf@modeerf·5 Tem

In earlier days, posting on social media could reliably draw in readers, giving newsrooms a bit more flexibility with real-time content production. Now, with search taking the lead, content teams are under greater pressure to deliver faster and more efficiently.

English

modeerf@modeerf·5 Tem

A leading editor noted a dramatic drop in traffic from Facebook, decreasing from one-fifth to just one-twentieth. As user attention shifted, search engines have re-emerged as the dominant source of traffic.

English

modeerf@modeerf·23 Şub

huggingface.co/blog/paligemma…

ZXX

modeerf@modeerf·23 Şub

harvey.ai/blog/enterpris…

ZXX

modeerf@modeerf·9 Ara

看來三小時後就是 Sora了（？） Taiwan 在支援國家內！ help.openai.com/en/articles/10…

中文

modeerf@modeerf·9 Ara

看了上週 OpenAI 要用來卷死全球開發者的『連續十二天 Live demo 產品發布』第二天的 Reinforced Fine-Tuning 後很好奇這是什麼黑科技，就嘗試去找了可能的基礎論文 REFT: Reasoning with REinforced Fine-Tuning 來看，做了些筆記，在此分享一下。 aichemy.cc/latest/openai-…

中文

modeerf@modeerf·24 Kas

WBSC 2024 十二強金牌賽 aichemy.cc/latest/wbsc-20… via @AIchemy 科技點石成金

中文

modeerf ретвитнул

Ethan Mollick@emollick·4 Kas

"Hey Claude with computer use, watch this construction site video & write up things you see that dangerous or good, create a spreadsheet of critical issues to address" (sped up) How firms use AI as manager, coach or panopticon is going to have a big impact on what work becomes.

English

110

632

5.7K

830.9K

modeerf@modeerf·12 Eki

Recently, our Japanese media crawler has frequently picked up the name Shunsaku Sagami. He's the 33-year-old founder of M&A Research Institute, a company specializing in using AI for corporate M&A. As of October 2023, his assets have reached $1.3 billion USD (~=42.3 billion TWD).

English

108

modeerf ретвитнул

anton 🇺🇸@atroyn·2 Ağu

yesterday evening i gave a presentation to founders, investors, and the ai community at @aixventureshq on how to think about ai application development. it was well received so i'm going to reproduce it in full here on x the everything app (which is also now a slide deck app).

English

104

738

160.2K

modeerf ретвитнул

Andrej Karpathy@karpathy·13 Ağu

SQL injection-like attack on LLMs with special tokens The decision by LLM tokenizers to parse special tokens in the input string (, <|endoftext|>, etc.), while convenient looking, leads to footguns at best and LLM security vulnerabilities at worst, equivalent to SQL injection attacks. !!! User input strings are untrusted data !!! In SQL injection you can pwn bad code with e.g. the DROP TABLE attack. In LLMs we'll get the same issue, where bad code (very easy to mess up with current Tokenizer APIs and their defaults) will parse input string's special token descriptors as actual special tokens, mess up the input representations and drive the LLM out of distribution of chat templates. Example with the current huggingface Llama 3 tokenizer defaults: Two unintuitive things are happening at the same time: 1. The <|begin_of_text|> token (128000) was added to the front of the sequence. 2. The <|end_of_text|> token (128001) was parsed out of our string and the special token was inserted. Our text (which could have come from a user) is now possibly messing with the token protocol and taking the LLM out of distribution with undefined outcomes. I recommend always tokenizing with two additional flags, disabling (1) with add_special_tokens=False and (2) with split_special_tokens=True, and adding the special tokens yourself in code. Both of these options are I think a bit confusingly named. For the chat model, I think you can also use the Chat Templates apply_chat_template. With this we get something that looks more correct, and we see that <|end_of_text|> is now treated as any other string sequence, and is broken up by the underlying BPE tokenizer as any other string would be: TLDR imo calls to encode/decode should never handle special tokens by parsing strings, I would deprecate this functionality entirely and forever. These should only be added explicitly and programmatically by separate code paths. In tiktoken, e.g. always use encode_ordinary. In huggingface, be safer with the flags above. At the very least, be aware of the issue and always visualize your tokens and test your code. I feel like this stuff is so subtle and poorly documented that I'd expect somewhere around 50% of the code out there to have bugs related to this issue right now. Even ChatGPT does something weird here. At best it just deletes the tokens, at worst this is confusing the LLM in an undefined way, I don't really know happens under the hood, but ChatGPT can't repeat the string "<|endoftext|>" back to me: Be careful out there.

English
152
440
3.1K
289.2K

~~modeerf ретвитнул~~

Evan Vucci@evanvucci·14 Tem
Republican presidential candidate former President Donald Trump raises his fist as he is rushed off stage after an assassination attempt during a campaign rally in Butler, Pa. @apnews
English
2.1K
9.4K
62.7K
6.7M

~~modeerf ретвитнул~~

Elon Musk@elonmusk·11 Tem
Improved version of @Neuralink update
English
3.8K
8.5K
66.8K
19.9M

~~modeerf ретвитнул~~

AI Engineer@aiDotEngineer·27 Haz
🔴 Day 2 stream! Get in here! youtube.com/watch?v=vaIiNZ…
YouTube
English
1
4
29
17.9K

Открыть

~~@AIchemy @aixventureshq @APNews @neuralink @elonmusk @BarackObama @taylorswift13 @cristiano~~