Why vector databases are having a moment as the AI hype cycle peaks

Vector databases are all the rage, judging by the number of startups entering the space and the investors ponying up for a piece of the pie. The proliferation of large language models (LLMs) and the generative AI (GenAI) movement have created fertile ground for vector database technologies to flourish.

While traditional relational databases such as Postgres or MySQL are well-suited to structured data — predefined data types that can be filed neatly in rows and columns — this doesn’t work so well for unstructured data such as images, videos, emails, social media posts, and any data that doesn’t adhere to a predefined data model.

Vector databases, on the other hand, store and process data in the form of vector embeddings, which convert text, documents, images, and other data into numerical representations that capture the meaning and relationships between the different data points. This is perfect for machine learning, as the database stores data spatially by how relevant each item is to the other, making it easier to retrieve semantically similar data.

This is particularly useful for LLMs, such as OpenAI’s GPT-4, as it allows the AI chatbot to better understand the context of a conversation by analyzing previous similar conversations. Vector search is also useful for all manner of real-time applications, such as content recommendations in social networks or e-commerce apps, as it can look at what a user has searched for and retrieve similar items in a heartbeat. 

Vector search can also help reduce “hallucinations” in LLM applications, through providing additional information that might not have been available in the original training dataset.

“Without using vector similarity search, you can still develop AI/ML applications, but you would need to do more retraining and fine-tuning,” Andre Zayarni, CEO and co-founder of vector search startup Qdrant, explained to TechCrunch. “Vector databases come into play when there’s a large dataset, and you need a tool to work with vector embeddings in an efficient and convenient way.”

In January, Qdrant secured $28 million in funding to capitalize on growth that has led it to become one of the top 10 fastest growing commercial open source startups last year. And it’s far from the only vector database startup to raise cash of late — Vespa, Weaviate, Pinecone, and Chroma collectively raised $200 million last year for various vector offerings.

Qdrant founding team. Image Credits: Qdrant

Since the turn of the year, we’ve also seen Index Ventures lead a $9.5 million seed round into Superlinked, a platform that transforms complex data into vector embeddings. And a few weeks back, Y Combinator (YC) unveiled its Winter ’24 cohort, which included Lantern, a startup that sells a hosted vector search engine for Postgres.

Elsewhere, Marqo raised a $4.4 million seed round late last year, swiftly followed by a $12.5 million Series A round in February. The Marqo platform provides a full gamut of vector tools out of the box, spanning vector generation, storage, and retrieval, allowing users to circumvent third-party tools from the likes of OpenAI or Hugging Face, and it offers everything via a single API.

Marqo co-founders Tom Hamer and Jesse N. Clark previously worked in engineering roles at Amazon, where they realized the “huge unmet need” for semantic, flexible searching across different modalities such as text and images. And that is when they jumped ship to form Marqo in 2021.

“Working with visual search and robotics at Amazon was when I really looked at vector search — I was thinking about new ways to do product discovery, and that very quickly converged on vector search,” Clark told TechCrunch. “In robotics, I was using multi-modal search to search through a lot of our images to identify if there were errant things like hoses and packages. This was otherwise going to be very challenging to solve.”

Marqo cofounders

Marqo co-founders Jesse Clark and Tom Hamer. Image Credits: Marqo

Enter the enterprise

While vector databases are having a moment amid the hullabaloo of ChatGPT and the GenAI movement, they’re not the panacea for every enterprise search scenario.

“Dedicated databases tend to be fully focused on specific use cases and hence can design their architecture for performance on the tasks needed, as well as user experience, compared to general-purpose databases, which need to fit it in the current design,” Peter Zaitsev, founder of database support and services company Percona, explained to TechCrunch.

While specialized databases might excel at one thing to the exclusion of others, this is why we’re starting to see database incumbents such as Elastic, Redis, OpenSearch, Cassandra, Oracle, and MongoDB adding vector database search smarts to the mix, as are cloud service providers like Microsoft’s Azure, Amazon’s AWS, and Cloudflare.

Zaitsev compares this latest trend to what happened with JSON more than a decade ago, when web apps became more prevalent and developers needed a language-independent data format that was easy for humans to read and write. In that case, a new database class emerged in the form of document databases such as MongoDB, while existing relational databases also introduced JSON support.

“I think the same is likely to happen with vector databases,” Zaitsev told TechCrunch. “Users who are building very complicated and large-scale AI applications will use dedicated vector search databases, while folks who need to build a bit of AI functionality for their existing application are more likely to use vector search functionality in the databases they use already.”

But Zayarni and his Qdrant colleagues are betting that native solutions built entirely around vectors will provide the “speed, memory safety, and scale” needed as vector data explodes, compared to the companies bolting vector search on as an afterthought.

“Their pitch is, ‘we can also do vector search, if needed,’” Zayarni said. “Our pitch is, ‘we do advanced vector search in the best way possible.’ It is all about specialization. We actually recommend starting with whatever database you already have in your tech stack. At some point, users will face limitations if vector search is a critical component of your solution.”

Source link

Visited 1 times, 1 visit(s) today

Related Article

X, A Bastion For Hate, Claims It Will Reduce Hate Content In The UK

Christopher Furlong/Getty Images X has committed to reducing “hate and terror content” in the UK, according to the regulator Ofcom, by speeding up its review process for offending content and “withhold access in the UK” to accounts which post “illegal terrorist content” and are determined to be “operated by

Fitbit Air vs Whoop Strap Comparison: Price, Features and AI

The Google Fitbit Air is very much the talk of the fitness tracking town right now, not only because it’s the first new Fitbit device that we’ve had in years, but it’s also one of the first big brands to go head-to-head with the established Whoop Strap (if you don’t count the Polar Loop and

India EV Sales Jump 62% As Global EV Market Tops 20 Million In 2025: ICCT

Latest research by the International Council on Clean Transportation (ICCT) showed a sharp rise in global sales of light-duty electric vehicles (EVs), which crossed 20 million units in 2025. The study noted that EVs accounted for nearly 25% of new light-duty vehicle (LDV) sales globally, up from around 19% in 2024 and 15% in 2023.

xAI Introduces Its Coding Agent Called Grok Build

xAI xAI has launched a coding agent of its own to serve as competitor to its rivals’ products, such as Anthropic’s Claude Code. It’s called Grok Build, and it’s still in its early beta version that’s initially only available to SuperGrok Heavy subscribers paying $300 per month for

OpenAI brings Codex to phones via ChatGPT app

ChatGPT-maker OpenAI introduced Codex desktop application in February this year. The company has now announced that its AI coding assistant, Codex is now available on mobile via the ChatGPT app. This will enable the developers to manage and approve coding tasks directly from their phones. The rollout, currently in preview for iOS and Android, expands

Spectrum Adds discovery+ Streaming App to Eligible TV Plans at No Additional Cost

Spectrum and Warner Bros. Discovery announced that the discovery+ streaming app is now included at no additional cost for customers with eligible Spectrum TV plans.  Now Spectrum TV customers can immediately begin streaming their favorite discovery+ hit shows, from 90 Day Fiancé to Gold Rush and Ghost Adventures. This builds on Spectrum’s Seamless

Comparing AT&T and Verizon Mobile Phone Plans

In the space of a few months, Verizon got a new CEO who lowered prices and AT&T revamped its entire postpaid phone plan lineup, then separately added a new top-tier plan. If you’re considering jumping to AT&T or Verizon for your phone service, or thinking about changing your existing plan, we’re here to compare their offerings. (Are you reading

Google confirms native, premium apps will be ready for Googlebook launch

Support our independent tech coverage. Chrome Unboxed is written by real people, for real people—not search algorithms. Join Chrome Unboxed Plus for just $2 a month to get an ad-free experience, access to our private Discord, and more. Learn more about membership here.START FREE TRIAL (MONTHLY)START FREE TRIAL (ANNUAL) When it comes to the success

Alienware’s First Affordable Gaming Laptop Is Arriving At The Perfect Time

Even though Alienware has been around for 30 years, the company hasn’t really made an affordable, entry-level gaming laptop. But that changes today with the succinctly named Alienware 15, and based on the rising price of seemingly every gadget, it couldn’t have come at a better time. Let’s start with the basics. The Alienware 15

Samsung Galaxy S24 Battery Explosion Reported

Summary created by Smart Answers AI In summary: Tech Advisor reports a Samsung Galaxy S24 allegedly exploded in a user’s hand in South Korea, causing minor burns and prompting a forensic investigation by Samsung. This incident recalls the 2016 Galaxy Note 7 recall crisis and raises concerns about ongoing battery safety issues despite Samsung’s eight-point

Modular EV Architecture Platforms Market Roadmap: Expected

Modular EV Architecture Platforms Market The Global Modular EV Architecture Platforms Market Study, a comprehensive analysis of the market that spans more than 143+ pages and describes the product and industry scope as well as the market prognosis and status for 2025-2032. The marketization process is being accelerated by the market study’s segmentation by important

12 Best Apps to Draw Tattoo Designs in 2026

In today’s digital age, tattoo apps have become indispensable tools for both aspiring and professional tattoo artists. They give tattoo artists the ability to create designs digitally, saving them time and allowing them to present professional designs to their clients. So, what app do tattoo artists use? which one is the best? Having the right

Cadillac hits 100,000 EV sales as Tesla drivers jump ship

Cadillac has hit a major milestone in the luxury EV race, reaching 100,000 electric vehicle sales just a few years after launching its first all-electric model. As Motor1 noted, GM said in January 2019 that Cadillac would spearhead its EV effort. That strategy began to take shape with the 2023 Lyriq, which was Cadillac’s first

Dangerous Deepfakes and AI Therapy: What Parents Need to Know

On this episode of Generation AI, host Derek Staahl digs into two AI trends that are hitting close to home for families right now. First: “nudify” apps and dangerous deepfakes. These tools can take an ordinary photo and turn it into an explicit AI-generated image—and a new investigation shows students around the world are being

0
Would love your thoughts, please comment.x
()
x