This page provides an overview for optimizing statement performance in CockroachDB. To get good performance, you need to look at how you're accessing the database through several lenses:
- SQL statement performance: This is the most common cause of performance problems and where you should start.
- Schema design: Depending on your SQL schema and the data access patterns of your workload, you may need to make changes to avoid creating transaction contention or hot spots.
- Cluster topology: As a distributed system, CockroachDB requires you to trade off latency vs. resiliency. This requires choosing the right cluster topology for your needs.
SQL statement performance rules
To get good SQL statement performance, follow these rules:
- Rule 1. Scan as few rows as possible. If your application is scanning more rows than necessary for a given statement, it's going to be difficult to scale.
- Rule 2. Use the right index. Your statement should use an index on the columns in the WHEREclause. You want to avoid the performance hit of a full table scan.
- Rule 3. Use the right join type. Depending on the relative sizes of the tables you are querying, the type of join may be important. You should only rarely need to specify the join type because the cost-based optimizer should pick the best-performing join type if you add the right indexes as described in Rule 2. However, in some circumstances you may want to specify a join hint.
These rules apply to an environment where thousands of OLTP statements are being run per second, and each statement needs to run in milliseconds. These rules are not intended to apply to analytical, or OLAP, statements.
For an example of applying the rules to a query, see Apply SQL Statement Performance Rules.
Schema design
If you are following the instructions in the SQL performance section and still not getting the performance you want, you may need to look at your schema design and data access patterns to make sure that you are not:
- Introducing transaction contention. For methods for diagnosing and mitigating transaction contention, see Transaction contention.
- Creating hot spots in your cluster. For methods for detecting and eliminating hot spots, see hot spots.
Cluster topology
It's very important to make sure that the cluster topology you are using is the right one for your use case. Because CockroachDB is a distributed system that involves nodes communicating over the network, you need to choose the cluster topology that results in the right latency vs. resiliency tradeoff.
For more information about how to choose the cluster topology that is right for your application, see Topology Patterns Overview.