Overview

Glean's enterprise search API handles search queries across multiple connectors, bulk document indexing, and connector sync throughput. Search latency compounds when querying across dozens of datasources simultaneously. Large indexing jobs (10K+ documents) require careful batching to avoid rate limits and maintain connector sync schedules. Optimizing batch sizes, caching frequent search results, and tuning connector configurations reduces search P95 latency and keeps indexing pipelines within SLA windows.

Caching Strategy

const cache = new Map<string, { data: any; expiry: number }>();
const TTL = { search: 60_000, suggestions: 30_000, datasources: 600_000 };

async function cached(key: string, ttlKey: keyof typeof TTL, fn: () => Promise<any>) {
  const entry = cache.get(key);
  if (entry && entry.expiry > Date.now()) return entry.data;
  const data = await fn();
  cache.set(key, { data, expiry: Date.now() + TTL[ttlKey] });
  return data;
}
// Search results expire fast (1 min). Datasource metadata is stable (10 min).

Batch Operations

Overview

Caching Strategy

const cache = new Map<string, { data: any; expiry: number }>();
const TTL = { search: 60_000, suggestions: 30_000, datasources: 600_000 };

async function cached(key: string, ttlKey: keyof typeof TTL, fn: () => Promise<any>) {
  const entry = cache.get(key);
  if (entry && entry.expiry > Date.now()) return entry.data;
  const data = await fn();
  cache.set(key, { data, expiry: Date.now() + TTL[ttlKey] });
  return data;
}
// Search results expire fast (1 min). Datasource metadata is stable (10 min).

Issue	Cause	Fix
Slow cross-datasource search	Too many connectors queried in parallel	Prioritize datasources, set query scope
429 on bulk indexing	Batch size or concurrency too high	Reduce to 100/batch, 3 concurrent, 500ms interval
Stale search results	Index lag after document updates	Use incremental indexing with webhooks on change
Connector sync timeout	Large datasource with no checkpointing	Enable incremental sync with cursor tracking
Missing documents in results	Incomplete metadata during indexing	Include title, body, author, and updated_at fields

Glean Performance Tuning

Overview

Caching Strategy

Batch Operations

Glean Performance Tuning

Overview

Caching Strategy

Batch Operations

Connection Pooling

Rate Limit Management

Monitoring

Performance Checklist

Error Handling

Resources

Next Steps

Session Logs

OpenClaw Test Heap Leaks

Node Connect

Openclaw Qa Testing

Openclaw Secret Scanning Maintainer

Flags