It’s impossible to answer that question without knowing what content/query domai... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		binarymax on Oct 29, 2024 \| parent \| context \| favorite \| on: Vector databases are the wrong abstraction It’s impossible to answer that question without knowing what content/query domain you are embedding. Checkout MTEB leaderboard, dig into the retrieval benchmark, and look for analogous datasets.

3abiton on Oct 29, 2024 [–]

So we're talking maximizing embedding model per use case? Medical dats would require differnet model than say sales data? Sounds very fragmented approach.

ekianjo on Oct 30, 2024 | [–]

The answer lies with a validation dataset that you create for testing.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact