Back to glossary

Infrastructure & DevOps

Database Indexing

The creation of data structures that speed up data retrieval operations by providing efficient lookup paths to rows matching specific query conditions. Indexes trade additional storage space and slower write performance for dramatically faster read queries.

A database index is conceptually similar to a book's index: instead of scanning every page to find a topic, you look up the topic in the index and jump directly to the relevant pages. Common index types include B-tree indexes for range queries, hash indexes for exact lookups, GIN indexes for full-text search, and specialized indexes for geometric, JSON, or array data.

For AI product teams, proper indexing is fundamental because AI features often query large datasets under tight latency constraints. A recommendation service that looks up user interaction history, a search feature that filters candidates by multiple attributes, or an analytics query that aggregates events across time ranges all depend on well-designed indexes. Growth teams should ensure that the queries powering experiment analysis, cohort segmentation, and funnel metrics have appropriate indexes, since slow analytics queries delay experiment conclusions and decision-making. Vector indexes, specifically designed for similarity search in high-dimensional spaces, are increasingly important for AI applications that use embeddings for semantic search, recommendation, and content matching.

Related Terms

Content Delivery Network

A geographically distributed network of proxy servers that caches and delivers content from locations closest to end users. CDNs reduce latency, improve load times, and absorb traffic spikes by serving content from edge nodes rather than a single origin server.

Edge Computing

A distributed computing paradigm that processes data closer to the source of generation rather than in a centralized data center. Edge computing reduces latency, conserves bandwidth, and enables real-time processing for latency-sensitive applications.

Serverless Computing

A cloud execution model where the provider dynamically manages server allocation and scaling. Developers deploy functions or containers without provisioning infrastructure, paying only for actual compute time consumed rather than reserved capacity.

Function as a Service

A serverless computing category where developers deploy individual functions that execute in response to events. FaaS platforms like AWS Lambda, Google Cloud Functions, and Azure Functions handle all infrastructure management, scaling each function independently.

Platform as a Service

A cloud computing model that provides a complete development and deployment environment without managing underlying infrastructure. PaaS offerings like Heroku, Vercel, and Google App Engine handle servers, storage, networking, and runtime configuration.

Infrastructure as a Service

A cloud computing model that provides virtualized computing resources over the internet. IaaS offerings like AWS EC2, Google Compute Engine, and Azure Virtual Machines give teams full control over servers, storage, and networking without owning physical hardware.