pgvector

Reasons Against pgvector: Technical Challenges at Scale

Honghao Wang

04 Nov 2025 — 1 min read

The Case Against pgvector — Key Takeaways

Read the full article: The Case Against pgvector

While the title might sound provocative, Alex Jacobs’ write‑up provides excellent technical insights.

It explores challenges encountered when operating pgvector — a PostgreSQL vector indexing extension — at scale, especially:

Maintaining large indexes with near real‑time updates.
Limitations and tradeoffs of IVFFlat and HNSW index types.

---

Pre‑ vs. Post‑Filtering: Why It Matters

One standout section is the discussion on query filtering order:

> Imagine a document search system with millions of vectors. Each document has metadata like status: draft, published, archived.

> A user searches — you only want published documents.

> Should Postgres filter first (pre‑filter) or search first, then filter (post‑filter)?

> This is not a trivial decision — it can mean queries take 50 ms vs 5 seconds, and dramatically change the relevance of results.

---

Hacker News Insights — Discourse’s Approach

The Hacker News thread includes valuable production context from Rafael dos Santos Silva (xfalcox), a Discourse developer.

> We run pgvector in production across thousands of databases, serving billions of page views.

> We make extensive use of quantization to control storage cost and performance.

Quantization in Practice

Storage: `halfvec` (16‑bit float)
Indexes: `bit` (binary vectors)

This enables Discourse to deploy embeddings efficiently across all hosted environments.

---

Embedding‑Powered Features at Discourse

Embeddings drive multiple features:

Related Topics — Suggests next reads via vector similarity.
Tag & Category Suggestions — While composing new topics.
Augmented Search — More relevant, richer search results.
RAG (Retrieval‑Augmented Generation) — Enhances retrieval from uploaded files.

---

Relevance Beyond PostgreSQL

Scaling challenges discussed here apply broadly to AI‑driven search, recommendations, and content indexing.

Creators in these fields now often combine:

AI content generation
Cross‑platform publishing
Analytics for optimization

---

Example: AiToEarn’s Unified Workflow

Platforms like AiToEarn官网 offer open‑source tools that:

Generate and publish content across channels — Douyin, WeChat, YouTube, X (Twitter), etc.
Provide analytics and model performance rankings (AI模型排名).
Streamline the link between creation, distribution, and monetization.

Such solutions are useful when balancing performance considerations (like pgvector indexing efficiency) with broad content reach.

---

In short: Whether you’re building scalable vector search in PostgreSQL, or architecting AI‑driven content workflows, these lessons in quantization, filtering strategy, and efficient retrieval are worth internalizing.

These College Students Are Helping OPPO Build AI Products

The Child Who Drew Worlds — Turning Doodles into AI Magic No one truly understands that child who talks to the paper while drawing. My little cousin always loves to speak to himself while painting — as if conversing with a friend from another world. "He’s not just drawing pictures

Ilya’s Shocking Testimony: Altman’s Wrongdoing, Mira’s Drama, and OpenAI’s Near-Merger with Anthropic

Silicon Valley Drama — Altman, Musk, and Ilya The saga of Sam Altman, Elon Musk, and Ilya Sutskever continues — Silicon Valley’s very own _“Empresses in the Palace”_, a tangled mix of ambition, rivalry, and philosophical clashes. Altman Breaks Silence on His OpenAI Equity Altman recently made a rare public comment

Elimination Game’s New Innovative Gameplay Hits $1M Monthly Revenue in 70 Days

A Rare Example of Light Gameplay Innovation Cover image generated by AI Casual elimination-based gameplay is one of the most highly monetizable genres in mobile gaming. This applies not only to traditional match-3 games, but also to variations using elimination mechanics—such as screw-tightening or rope-untying. While surface mechanics vary,

One Brief Awakens Eight AI Agents: A New Era for Advertisers | 43 Talks

AI Reshaping the Marketing Industry The wave of AI is driving a deep structural transformation in marketing. At the 43 Talks 5th Speculative Salon — AI Marketing Session, co-hosted by 43 College, Yilun AI, and Origin Academy, the first guest speaker — David Chu, CEO of Zhiwei Jianxing and founder of the