Strange data form Spain

ECDC data set for Spain shows some strange patterns, on 2 days daily death count is negative. One figure recorded 25th May is huge -1918, the other form 12th Aug is just -2 but still it should not be negative. I understand there may be situations where data need correction, but in data set presenting time series it should be done by correcting data at date when it was previously overstated. Otherwise entire data collection process becomes doubtful. Figure below shows daily deaths data for Spain with potentially wrong entries marked by orange dots.

Table below lists data points which need to be corrected:

indexDateRepGeoIdCasesDeathsCountries and territories
1464292020-04-27ES16600Spain
2464282020-04-28ES1525632Spain
3464042020-05-22ES1787688Spain
4464012020-05-25ES-372-1918Spain
5464002020-05-26ES859283Spain
6463762020-06-19ES3071179Spain
7463222020-08-12ES3172-2Spain
8462382020-11-04ES250421623Spain
  1. Items 1,2 – Deaths from 2 days were probably recorded under one date
  2. Item 3 unusually high figure comparing to nearby points
  3. Item 4 negative deaths count (-1918)
  4. Items 5, 6 unusually high figure comparing to nearby points
  5. Item 7 negative death count (-2)
  6. Item 8 surge in death counts, can be attributed to 2nd wave impact, but for me it look as data glitch since it stands out nearby points

Conclusions

Items 3 to 7 from above table combined (688-1918+283+1179-2) total 230. Recording this figure on 26th May and zeroing existing entries can be a quick fix to the data, but the case requires a deeper investigation how Spanish data are reported. It is especially important if we take into account deaths surge reported Nov 4th. It looks like a data collection glitch, but it may as well represent valid data resulting from 2nd wave, so it definitely need investigation.

Added 2020-11-13:

It seems Spain has own understanding of time series. November spike comes from re-stating definition of Covid-19 deaths. Why do they post +1300 deaths occurred prior to 11th May together with current data (297 deaths on 2020-11-04) in November is hard to comprehend. Such an approach clearly distorts 2nd wave statistics. https://www.aa.com.tr/en/europe/spain-s-covid-19-death-toll-surges-by-1-623/2032447

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.