Historical API data has a lot of missing data

theniceguy
Hi,

I was pulling historical data from Zerodha API for almost all exchanges and all instruments.

I did an distribution analysis of the number of ticks(minute interval)
for 3 below exchanges, I found that as shown in below table, I could find less than 10 ticks data for the corresponding number of days

| exchange | name | days |
|:-----------|-------:|------------:|
| BCD | 3 | 74 |
| CDS | 4 | 98 |
| NFO | 196 | 3625 |

Also for all instruments, i checked the number of hours data available for each instrument for each day, and below is the data distribution, where I see a high number of days-instrument token combinations where i see 33964 with less than 1 hour of data.


| no of hours | n_recs |
|--------:|---------:|
| 8 | 992 |
| 7 | 1068 |
| 6 | 100734 |
| 5 | 37768 |
| 4 | 15702 |
| 3 | 11727 |
| 2 | 10330 |
| 1 | 11390 |
| 0 | 33964 |




can someone please explain why such data? are these discrepancies/missing data or should we expect this kind of missing data?
Sign In or Register to comment.