Kartikey

282 posts

Kartikey

@devwithkartik

Working @gridgain

Mumbai Katılım Şubat 2025

108 Takip Edilen24 Takipçiler

Sabitlenmiş Tweet

Kartikey@devwithkartik·8 Ağu

Meet InfraKit, a supercharged CLI that scaffolds backend projects with production-ready essentials in seconds. No more starting with “Hello World” and spending days setting up the basics. Let’s unpack it.

English

290

Kartikey@devwithkartik·16 Şub

@SumitM_X I'll add vector search

English

117

SumitM@SumitM_X·15 Şub

As a backend engineer. Please learn: - DB - System Design - Algorithms - API design - API Security - CI/CD - DS - Docker/ Kubernetes - Caching - Messaging Stop jumping from one language to the other

English

144

1.4K

102K

Kartikey@devwithkartik·4 Ara

@dikshatwt Solution : Just make a todo list(bare minimum things you think you can do) and tick off everything by eod.

English

Dikshaa@dikshatwt·3 Ara

How to ruin yourself: 1. Stay on your phone all day. 2. Feel sad for no clear reason. 3. Stop eating well and ignore your studies. 4. Sleep super late and wake up in the afternoon. 5. Let sadness take over everything. 6. Always look at others' lives and feel yours isn't enough. 7. Keep blaming yourself for the past but never try to let it go. 8. Compare your progress with people who started years before you. 9. Get stuck imagining outcomes instead of creating them. 10. Keep waiting for motivation instead of building discipline.

English

139

774

5.9K

255K

Kartikey@devwithkartik·29 Eki

@carlcamilleri @brankopetric00 While scaling, it becomes a bottleneck for cpus and hence libraries like FAISS have both their CPU & GPU versions. However, vector indexing and search on cpu is more precise than doing the same using gpu. But cpus are slower. But then we can't just go on to burn compute hence gpu

English

Carl Camilleri@carlcamilleri·27 Eki

@brankopetric00 “Convert search into numbers” and “turn text into numbers” - how computationally expensive are these steps? Does it remain feasible for a fast-changing dataset, compared to traditional Lucene-based lexical search?

English

709

Branko@brankopetric00·27 Eki

Vector databases explained for people who just want to understand. You have 10,000 product descriptions. User searches for "comfortable outdoor furniture." Traditional database: - Searches for exact word matches - Finds products containing "comfortable" OR "outdoor" OR "furniture" - Misses "cozy patio seating" even though it's the same thing - Keyword matching is stupid Vector database approach: - Convert search into numbers representing meaning: [0.2, 0.8, 0.1, 0.9, ...] - Convert every product description to similar numbers - Find products with similar number patterns - Returns "cozy patio seating" because the numbers are close - Meaning matching is smart How it works: Step 1: Turn text into vectors (arrays of numbers) - "comfortable chair" becomes [0.2, 0.7, 0.1, 0.4, ...] - "cozy seat" becomes [0.3, 0.8, 0.2, 0.5, ...] - Similar meanings = similar numbers - Uses AI models like OpenAI embeddings Step 2: Store vectors efficiently - Traditional database: Stores text - Vector database: Stores arrays of numbers per item - Indexes them for fast similarity search - Optimized for "find similar" not "find exact" Step 3: Search by similarity - User query: "outdoor furniture" - Convert to vector: [0.3, 0.6, 0.2, 0.8, ...] - Find closest vectors using math (cosine similarity) - Returns items ranked by similarity score Use cases: - Product search that understands intent - Documentation search that finds relevant answers - Recommendation engines - Chatbots that find similar questions - Anomaly detection Popular vector databases: - Pinecone: Managed, easy, expensive - Weaviate: Open source, feature-rich - Milvus: Fast, scalable, complex - pgvector: Postgres extension, simple - Qdrant: Fast, Rust-based Controversial take: You don't need a vector database for most projects. Start with Postgres + pgvector extension. Vector databases are great for scale. For under 1 million vectors, your regular database with a vector extension works fine.

English

146

1.6K

142.5K

Kartikey@devwithkartik·29 Eki

@westoque @brankopetric00 Training the model isn't a part of vector databases imo. We generally use a pre trained model. Not aure on this though

English

William Estoque@westoque·27 Eki

@brankopetric00 you forgot the training step. someone actually needs to associate those word meanings that are close with each other and that is no easy feat. what makes a better search is how precise the training data is. arguably this is where a lot of the competition is now.

English

1.7K

Kartikey@devwithkartik·29 Eki

@mohit__kulhari @brankopetric00 Maintaining the indexes, index rebuilding(in somw cases) when inserting data and the maintaining the precision becomes difficult as we keep scaling for a million or for billion. Though libraries like FAISS, JVector help a lot and are built to mitigate this issue.

English

Lucky@luckyy__ai·28 Eki

@brankopetric00 what are the biggest challenges when scaling vector databases to millions of records?

English

245

Kartikey@devwithkartik·29 Eki

@DataSpeeder @brankopetric00 Think of populating the database like inserting vectora in the db. A pre trained model like openAi or llama-3 when given the words cozy or comfortabkle, go on to give similar kind of embeddings. In simple words, they give vectors that are close to each other in space.

English

DataSpeeder@DataSpeeder·27 Eki

@brankopetric00 What is populating the vector database, and how does it determine that "cozy" and "comfortable" are close?

English

3.5K

Kartikey@devwithkartik·13 Eki

DO NOT START WITH VECTOR SEARCH WITHOUT GETTING YOUR MATHEMATICAL CONCEPTS CLEAR.

English

Kartikey@devwithkartik·15 Eyl

@Dearme2_ God is good

English

Kartikey@devwithkartik·14 Eyl

Reply with your dream destination in 2025. #TravelGoals

English

Kartikey@devwithkartik·9 Eyl

What’s one question you ask yourself often? #DeepThoughts

English

Kartikey@devwithkartik·9 Eyl

Networking online is powerful, but real connections happen in replies. Drop a hi and let’s connect. #Networking

English

Kartikey@devwithkartik·9 Eyl

What’s the hardest decision you’ve ever made? #DecisionMaking

English

Kartikey@devwithkartik·9 Eyl

If you could write a letter to your future self, what would you say? #SelfReflection

English

Kartikey@devwithkartik·9 Eyl

What’s one thing you do that keeps you grounded? #Mindfulness

English

Kartikey@devwithkartik·9 Eyl

Would you rather build wealth fast or slow and steady? #MoneyTalk

English

Kartikey@devwithkartik·8 Eyl

Reply with your favorite productivity quote. #Productivity

English

Kartikey@devwithkartik·8 Eyl

Who’s the most inspiring person you follow here? #TwitterCommunity

English

Kartikey@devwithkartik·8 Eyl

What’s one lesson you’ve learned from failure? #FailureLessons

English

Kartikey@devwithkartik·8 Eyl

What motivates you when you’re completely drained? #Motivation

English

Kartikey@devwithkartik·8 Eyl

Do you think remote work is here to stay? #FutureOfWork

English

Keşfet

@SumitM_X @dikshatwt @carlcamilleri @brankopetric00 @westoque @DataSpeeder @Dearme2_ @elonmusk