Name: Mongodb Query Patterns
Author: 9tykeshav

Mongodb Query Patterns | Skills Pool

Request → Controller → Service A → Service B → Service C → Response
                          ↓            ↓            ↓
                        needs        needs        needs
                       doc X        doc X        doc X

Rule	How	Why
Project only needed fields	`.select('name email status')`	Full documents carry every field — list exactly what the caller needs, never widen "just in case"
Return plain objects for reads	`.lean()`	Skips Mongoose hydration — 2-5x faster for read-only paths
Ensure filter fields are indexed	`schema.index({ field: 1 })`	Without an index, MongoDB scans every document (COLLSCAN)
Batch related reads	`.find({ _id: { $in: ids } })` + Map	N round-trips collapse to 1
Batch related writes	`Model.bulkWrite([...])`	N sequential updates collapse to 1

// WRONG — each service queries the same document independently
async function checkout(orderId: string) {
  const order = await Order.findById(orderId).lean();
  await validateInventory(orderId);    // fetches order again
  await calculateTax(orderId);         // fetches order again
  await processPayment(orderId);       // fetches order again
}

// RIGHT — fetch once, pass through, always provide DB fallback
async function validateInventory(orderId: string, prefetched?: IOrder) {
  const order = prefetched
    || await Order.findById(orderId).select('items quantities').lean();
  // ...
}

async function checkout(orderId: string) {
  const order = await Order.findById(orderId).lean();
  await validateInventory(orderId, order);
  await calculateTax(orderId, order);
  await processPayment(orderId, order);
}

// WRONG — N+1: one query per item
for (const order of orders) {
  const product = await Product.findById(order.productId);
  // ...
}

// RIGHT — 1 query + O(1) lookups
const productIds = orders.map(o => o.productId);
const products = await Product.find({ _id: { $in: productIds } })
  .select('name price image')
  .lean();
const productMap = new Map(products.map(p => [p._id.toString(), p]));

for (const order of orders) {
  const product = productMap.get(order.productId.toString());
  // ...
}

// WRONG — identical query in both branches
if (userProvidedId) {
  const doc = await Order.findById(orderId).select('status').lean();
  // validate...
} else {
  const doc = await Order.findById(orderId).select('status').lean();
  // use directly...
}

// RIGHT — query once
const doc = await Order.findById(orderId).select('status').lean();
if (userProvidedId) { /* validate */ } else { /* use */ }

// WRONG — two round-trips to the same collection
const totalCount = await Booking.countDocuments({ eventId });
const perTypeCount = await Booking.aggregate([
  { $match: { eventId } },
  { $group: { _id: '$ticketType', count: { $sum: 1 } } },
]);

// RIGHT — one fetch, derive both
const bookings = await Booking.find({ eventId })
  .select('ticketType')
  .lean();
const totalCount = bookings.length;
const perTypeCount = new Map();
for (const b of bookings) {
  perTypeCount.set(b.ticketType, (perTypeCount.get(b.ticketType) || 0) + 1);
}

// WRONG — N sequential round-trips
for (const item of items) {
  await Item.findByIdAndUpdate(item._id, { $set: { archived: true } });
}

// RIGHT — 1 round-trip
await Item.bulkWrite(
  items.map(i => ({
    updateOne: {
      filter: { _id: i._id },
      update: { $set: { archived: true } },
    },
  }))
);

// WRONG — if orders has 50 items, this fires 50+ hidden queries
const orders = await Order.find({ userId })
  .populate('product')
  .populate('seller')
  .populate('reviews');

// RIGHT — batch manually
const orders = await Order.find({ userId }).select('product seller').lean();
const productIds = orders.map(o => o.product);
const sellerIds = orders.map(o => o.seller);

const [products, sellers] = await Promise.all([
  Product.find({ _id: { $in: productIds } }).select('name price').lean(),
  Seller.find({ _id: { $in: sellerIds } }).select('name rating').lean(),
]);
const productMap = new Map(products.map(p => [p._id.toString(), p]));
const sellerMap = new Map(sellers.map(s => [s._id.toString(), s]));

// WRONG — 2 round-trips, fetches entire document
const user = await User.findById(userId);
user.lastLogin = new Date();
await user.save();

// RIGHT — 1 round-trip, touches only the field
await User.updateOne(
  { _id: userId },
  { $set: { lastLogin: new Date() } }
);

// WRONG — sequential, each waits for the previous
const users = await User.find({ _id: { $in: userIds } }).select('name email').lean();
const responses = await FormResponse.find({ eventId }).select('userId answers').lean();
const reviews = await Review.find({ eventId }).select('userId rating').lean();

// RIGHT — concurrent, all fire at once
const [users, responses, reviews] = await Promise.all([
  User.find({ _id: { $in: userIds } }).select('name email').lean(),
  FormResponse.find({ eventId }).select('userId answers').lean(),
  Review.find({ eventId }).select('userId rating').lean(),
]);

schema.index({ order_id: 1 });                 // single field
schema.index({ userId: 1, status: 1 });         // compound — high-selectivity field first
schema.index({ event_id: 1, type: 1 });         // compound for filtered aggregations

[ ] Traced the request chain — no document is fetched more than once across the call stack
[ ] Services accept pre-fetched documents as optional params with DB fallback
[ ] Passing documents between functions, not just IDs
[ ] No query executes inside a loop — batched with $in where needed
[ ] No .populate() on a list of documents — batch manually with $in + Map
[ ] Every read query has .select() with only needed fields
[ ] Every read-only query has .lean()
[ ] Updates that don't need hooks use updateOne/bulkWrite, not findById + save
[ ] Filter fields have corresponding schema.index() definitions
[ ] Multiple writes use bulkWrite, not a loop
[ ] Independent queries run concurrently with Promise.all
[ ] Same query doesn't appear in multiple code branches — hoisted above
[ ] .select() lists exactly the fields needed — not widened "just in case"

Mistake	Why It Breaks
Passing IDs between services instead of documents	Forces every downstream service to re-query — pass the document
`.populate()` on a list of documents	Each populate fires a separate query per document — silent N+1
`findById` + modify + `.save()` for simple field update	2 round-trips + full document fetch — use `updateOne` directly
`.select()` without `.lean()`	Still returns Mongoose document with full change tracking overhead
`.lean()` then calling `.save()`	`.lean()` returns plain objects — no Mongoose methods available
Batch `$in` + `array.find()` for lookup	O(n²) — always use a Map for O(1) lookups
Prefetch param without DB fallback	Breaks webhook/cron callers that don't have the pre-fetched data
Sequential independent queries	Use `Promise.all` — 3 queries at 100ms each take 100ms, not 300ms
Widening `.select()` "just in case"	Fetch only what the caller actually uses — extra fields waste bandwidth
Unbounded `.find()` without limit	Risk of loading entire collection into memory
Index on rarely-queried field	Indexes slow every write for no read benefit

Mongodb Query Patterns

Overview

When to Use

Data Flow Principles

1. Trace the Request Chain First

Mongodb Query Patterns

Overview

When to Use

Data Flow Principles

1. Trace the Request Chain First

2. Fetch Wide at the Top, Narrow Below

3. Every Query Must Justify Its Existence

Query Efficiency Rules

Patterns

Pass Data Through Service Boundaries

Batch Instead of Loop

Hoist Common Queries

Consolidate Same-Collection Queries

Bulk Writes Over Loops

Avoid Hidden N+1 from `.populate()`

Update Directly — Don't Fetch to Modify

Run Independent Queries Concurrently

Index Every Filter Field

Checklist

Common Mistakes

Vector Index Tuning

Azure Resource Manager Redis Dotnet

Redis Expert

Elasticsearch

Cache Expert

Abp Mongodb

Mongodb Query Patterns

Overview

When to Use

Data Flow Principles

1. Trace the Request Chain First

Mongodb Query Patterns

Overview

When to Use

Data Flow Principles

1. Trace the Request Chain First

2. Fetch Wide at the Top, Narrow Below

3. Every Query Must Justify Its Existence

Query Efficiency Rules

Patterns

Pass Data Through Service Boundaries

Batch Instead of Loop

Hoist Common Queries

Consolidate Same-Collection Queries

Bulk Writes Over Loops

Avoid Hidden N+1 from .populate()

Update Directly — Don't Fetch to Modify

Run Independent Queries Concurrently

Index Every Filter Field

Checklist

Common Mistakes

Vector Index Tuning

Azure Resource Manager Redis Dotnet

Redis Expert

Elasticsearch

Cache Expert

Abp Mongodb

Avoid Hidden N+1 from `.populate()`