What is PK Chunking?

by Dhanik Lal Sahni July 6, 2022

written by Dhanik Lal Sahni July 6, 2022

PK stands for Primary Keys in all database systems. This helps us in creating an index in Salesforce object fields which makes our query faster. Chunk is a small set of records in terms of database. So basically PK chunking is used to split the queries into manageable chunks which will increase application performance.

When to Use PK Chunking?

We should enable PK chunking when querying tables with more than 10 million records or when a bulk query constantly times out. PK Chunking is a supported feature of the Salesforce Bulk API, so it does all the work of splitting the queries into manageable chunks.

How to enable PK Chunking?

To enable auto PK chunking in Bulk Query Job, we have to use the primary key (PK) chunking request header. PK chunking splits bulk queries on large tables into chunks based on the record IDs, or primary keys, of the queried records. Each chunk is processed as a separate batch that counts toward our daily batch limit, and we must download each batch’s results separately. PK chunking works only with queries that don’t include subqueries or conditions other than WHERE.

How PK Chunking Work?

In general terms, we should first query the target table to identify a number of chunks of records with sequential IDs. Then we should submit separate queries to extract the data in each chunk and finally combine the results.

PK chunking works by adding record ID boundaries to the query with a WHERE clause, limiting the query results to a smaller chunk of the total results. The remaining results are fetched with extra queries that contain successive boundaries. The number of records within the ID boundaries of each chunk is referred to as the chunk size. The first query retrieves records between a specified starting ID and the starting ID plus the chunk size. The next query retrieves the next chunk of records, and so on.

When a query is successfully chunked, the original batch’s status shows as NOT_PROCESSED. If the chunking fails, the original batch’s status shows as FAILED, but any chunked batches that were successfully queued during the chunking attempt are processed as normal.

The default chunk size is 100,000, and the maximum size is 250,000. The default starting ID is the first record in the table. However, we can specify a different starting ID to restart a job that failed between chunked batches.

Sforce-Enable-PKChunking: chunkSize=25000; startRow=00130000000xEftMGH

Above PK chunk header will set chunk size of 25,000 and it will start chunking from record id 00130000000xEftMGH.

References

PK Chunking Header

Similar Posts

Queueable Vs Batch Apex In Salesforce

Salesforce Interview Question for Integration

What is Light DOM?

What is Data Skew?

What are Skinny Tables?

Dhanik Lal Sahni

Dhanik La Sahni is working as Salesforce Solution Architect and has around 16 years of Experience in Web Based Applications. In this experience, he has worked with various technology like SalesForce, .NET, .NET Core, MS Dynamic CRM, Azure, Oracle, SQL Server, WCF, Ionic, and Angular. He is more focused on Technology instead of Management. He loves to know and research new technology.

What is Data Skew?

4 comments

What is SOQL Injection? - SalesforceCodex July 8, 2022 - 8:57 am

[…] What is PK Chunking? […]

20 Scenario-based Salesforce Developer Interview Questions February 6, 2024 - 3:39 pm

[…] What is PK Chunking? […]

Top 30 Scenario-Based Salesforce Developer Interview Questions - Salesforce Codex February 18, 2024 - 4:09 am

[…] What is PK Chunking? […]

Questions for Tech Lead/Salesforce Architect Interview May 15, 2024 - 4:56 am

[…] Of Integration Patterns in SalesforceDifference Between Custom Setting and Custom Metadata TypeWhat is PK Chunking?What is Data […]

What is PK Chunking?

When to Use PK Chunking?

How to enable PK Chunking?

How PK Chunking Work?

References

Similar Posts

Related

What is Data Skew?

What is SOQL Injection?

You may also like

4 comments

Leave a Comment Cancel Reply