Travis Biehn

130 posts

Travis Biehn

@tbiehn

Travis Biehn is lost in a single pane of glass fun-house.

Mobile Katılım Nisan 2011

0 Takip Edilen127 Takipçiler

Sabitlenmiş Tweet

Travis Biehn@tbiehn·7 Şub

#Bitcoin in #Ruby with JavaScript in IE in VisualBasic with Wine on #Linux with Bochs on Google #Glass. #venturecapital DM me.

English

Travis Biehn@tbiehn·28 Oca

@gavin_gee @carrigmat Nebius looks like <24$ / hr for 8xh100 - 640gb of graphics ram 🤔 modestly or selective quant on that and you’re cooking with gas.

English

435

GG@gavin_gee·28 Oca

@carrigmat Would a x2idn.32xlarge on AWS work too? 128vcpu and 2048 gb RAM. It’s $13 an hour.

English

20.6K

Matthew Carrigan@carrigmat·28 Oca

Complete hardware + software setup for running Deepseek-R1 locally. The actual model, no distillations, and Q8 quantization for full quality. Total cost, $6,000. All download and part links below:

English

712

3.5K

27.6K

5.5M

Travis Biehn@tbiehn·21 Oca

@positiveblue2 @simonw R1 has tool calling tokens in its vocab & template. Ollama doesn’t have a generic template replacer for DeepSeek R1 yet, but the models likely support func calling. FIM tokens are in there too.

English

positiveblue ⚡️🍠@positiveblue2·20 Oca

@simonw Yes, that's what I did with Phi 4. Llama 3.2B interacts with the user but uses Phi 4 (with ctx about tools) for reasoning and from the Phi4 prompt response it infers what tools to call However, it would be nice to have it built in so you can run a 3B/8B models in the edge.

English

133

Simon Willison@simonw·20 Oca

DeepSeek released a whole family of inference-scaling / "reasoning" models today, including distilled variants based on Llama and Qwen Here are my notes on the new models, plus how I ran DeepSeek-R1-Distill-Llama-8B on my Mac using Ollama and LLM simonwillison.net/2025/Jan/20/de…

English

468

48.7K

Travis Biehn@tbiehn·5 Kas

So much ‘ad absurdum’ is now only compute bound. So, do we?

English

Travis Biehn@tbiehn·31 Ağu

@SwannMarcus89 He said, chuckling, downing his sixth whiskey at the company Christmas party…

English

Swann Marcus@SwannMarcus89·30 Ağu

I don't understand why people freak out about this picture given that it might be the single most easily avoidable death to ever happen to someone

≝ kennedy-sloane@beepboopsloane

sometimes i just look at this image and give myself a panic attack

English

1.6K

10.4K

326.2K

17.3M

Travis Biehn@tbiehn·16 Ağu

@CynoPrime @CrackMeIfYouCan Interesting to see early use of ChatGPT in password cracking iterations, thanks for the write up!

English

CynoSure Prime@CynoPrime·16 Ağu

We just posted our 2023 @CrackMeIfYouCan write-up. blog.cynosureprime.com/2023/08/korelo…

English

2.5K

Travis Biehn@tbiehn·11 Ağu

@Saboo_Shubham_ PassGAN is over 4 years old at this point - the original paper compared the technique with other SOTA approaches - PassGAN is one of the worst generative strategies available when compared to JTR & HashCat dict+rules. The password strength reality is worse with those tools.

English

Shubham Saboo@Saboo_Shubham_·11 Ağu

Learn More: homesecurityheroes.com/ai-password-cr… Follow me @Saboo_Shubham_ to stay updated with the latest AI developments.

English

13.4K

Shubham Saboo@Saboo_Shubham_·11 Ağu

AI can crack 51% of passwords in less than 1 min 🤯 Meet PassGAN, a Generative Adversarial Network (GAN) that can autonomously learn the distribution of real passwords from actual password leaks. It can crack any kind of 7 chars password in <6 mins even if it contain symbols.

English

217

937

459.5K

Travis Biehn@tbiehn·9 Ağu

@jerryjliu0 I do this - for specific tasks I see that, for example, 1 of 4 strategies usually works best, however there's always a few cases where other ones produce the best answers.

English

Jerry Liu@jerryjliu0·8 Ağu

The full guide and notebook are linked below. Check it out! Guide: gpt-index.readthedocs.io/en/stable/exam… Notebook: github.com/jerryjliu/llam…

English

Jerry Liu@jerryjliu0·8 Ağu

There are too many options for building information retrieval: - Chunk size - Query strategy (top-k, hybrid, MMR) Idea: What if we ensembled *all of the options* + let an LLM prune the pooled results? 👇 ✅ More general retriever (though more 💰) ✅ Benchmark diff strategies

English

171

31.6K

Travis Biehn@tbiehn·1 Ağu

@omarsar0 I produced an example of this technique three months ago here: github.com/tbiehn/thought… seemed kinda obvious as a strategy.

English

435

elvis@omarsar0·31 Tem

Skeleton-of-Thought: LLMs can do parallel decoding Interesting prompting strategy which firsts generate an answer skeleton and then performs parallel API calls to generate the content of each skeleton point. Reports quality improvements in addition to speed-up of up to 2.39x. Big deal given how costly in terms of latency some tasks are. This a great paper to rethink the necessity of sequential decoding of current LLMs. arxiv.org/abs/2307.15337

English

130

583

131.8K

Travis Biehn@tbiehn·23 Tem

@simonw I'm generating and retrieving embeddings using a few different strategies, then use GPT-4 to compare and rank those responses. Doing a 'vibe check' on those strategies is important, getting ranked evals gives you prelim data to work from.

English

Simon Willison@simonw·23 Tem

Search engineers have spent decades figuring out good ways to evaluate if their search relevance algorithms are returning good results or not, we need to be adapting similar strategies

English

Simon Willison@simonw·22 Tem

With respect to retrieval augmented generation for answering user questions, what's the current accepted best practice on how best to chunk up text for indexing in a vector database? (If this question makes no sense to you my post here might help simonwillison.net/2023/Jan/13/se… )

English

313

110.6K

Travis Biehn@tbiehn·6 Tem

dualuse.io/blog/llm-power… Out today - find out how to use @OpenAI LLMs & @pinecone kNN retrieval to automate the creation of AppSec tool rules - seriously reducing the effort required for everyone to stay ahead of threats.

English

245

Travis Biehn@tbiehn·29 Haz

I've released a complimentary tool to ThoughtLoom that lets you do k-nearest-neighbors embeddings search from your CLI. I've used it for all sorts of nonsense. github.com/tbiehn/embedme…

English

117

Travis Biehn@tbiehn·28 Haz

Getting formatted data out of your LLM is a PITA. Specify JSONSpec functions for the new OpenAI API function support in the latest version of ThoughtLoom. Tell the model to use one, and voila - structured, escaped, typed emissions. github.com/tbiehn/thought…

English

Travis Biehn@tbiehn·13 Haz

LLM powered program exploration will be another leap for dynamic application security testing - at least as big as concolic fuzzing. Here's some great work showing how LLMs can make their way through arbitrary workflows and arbitrary user-interfaces: osu-nlp-group.github.io/Mind2Web/

English

106

Travis Biehn@tbiehn·12 Haz

@itsandrewgao @OpenAI @GeneChaser Nice - here's a slightly different take on multithreading tasks to OpenAI / Azure hosted LLM that lets you define templates for jobs - github.com/tbiehn/thought… . There's some examples of banging different jobs together into a pipeline.

English

andrew gao@itsandrewgao·11 Haz

🧵🐇LightspeedGPT => 133 tokens/sec. 🐇 Multithreading + error handling for rate limits & @openai API overloads. Used it to translate a 9,000 word article into Spanish in just 90 seconds. Code: github.com/andrewgcodes/l… stay tuned for more from @GeneChaser ⛵ #AI #GPT4 👇👇

andrew gao@itsandrewgao

does openai have any plans to fix slow API speed?

English

263

187K

Travis Biehn@tbiehn·12 Haz

Exceptional.

Aran Komatsuzaki@arankomatsuzaki

Mind2Web: A Benchmark for Language Grounding to Automate Tasks on the Web - Presents the first dataset for developing and evaluating generalist agents for the web - Over 2,000 tasks collected from 137 websites proj: osu-nlp-group.github.io/Mind2Web/ abs: arxiv.org/abs/2306.06070

English

Travis Biehn@tbiehn·12 Haz

@_atilla1 Good UX. I wonder how they'll make it not suck to use physically? Try mashing your fingers into a table, or wiggling them in the air for an extended period of time. Could definitely see us spending more time and doing less trivial work using a paired physical keyboard.

English

189

Atilla@_atilla1·11 Haz

Attention to details is crucial, especially when it comes to interactions. 👇🏼Here's a little breakdown of the keyboard interaction and visual feedback in visionOS. 1. Look at how the keys get highlighted when hovering with the fingers over them. ❤️ 2. Pressing a key pushes it downwards on the Z axsis. 3. Additionally a little circular pulse expands outwards for visual confirmation. Just looking at this is so satisfying, but typing in Vision Pro must be another level of satisfaction.

GIF

English

191

2.2K

893.5K

Travis Biehn@tbiehn·8 Haz

@ka Yo! Glad to hear you dig it.

English

Travis Biehn@tbiehn·31 May

Large Language Models will disrupt how we do application security. Read my post to find out where, why, and how; dualuse.io/blog/llm-power…

English

Travis Biehn@tbiehn·19 May

Experimenting with LLMs? Love CLI? Always thought ideal CLI IPC was JSON? You’re one of the 5 people I wrote github.com/tbiehn/thought… for! Comes with a bunch of examples - including (cyber)security; writing reports from scan results, & generating fix patches from semgrep results.

English

Travis Biehn retweetledi

Clint Gibler@clintgibler·29 Mar

🔮 Harnessing the Hive Mind: How Semgrep and @pdiscoveryio's Nuclei Are Shaping the Future of Security Engineering 🔥 overview of the benefits of open source security tooling, modern security engineering, and where things are headed dualuse.io/blog/harnessin…

English

5.7K

Keşfet

@gavin_gee @carrigmat @positiveblue2 @simonw @SwannMarcus89 @CynoPrime @CrackMeIfYouCan @Saboo_Shubham_