Exploratory Data Analysis
Counts of the reported symptoms for each year
Year over year, is the ratio of reported symptoms increasing, decreasing, or staying consistent?
—
SELECT year (s_date) as Year , count(*) CNT_VAERS_ID , sum(sym_cnt ) Sum_sym_cnt from dbo.[VAERSDATA] group by year (s_date) order by 1
—
| Year | CNT_VAERS_ID | Sum_sym_cnt | Average_per_year |
|---|---|---|---|
| 1990 | 2,151 | 6,383 | 2.97 |
| 1991 | 9,992 | 29,421 | 2.94 |
| 1992 | 10,817 | 33,345 | 3.08 |
| 1993 | 10,309 | 35,089 | 3.40 |
| 1994 | 10,354 | 34,753 | 3.36 |
| 1995 | 10,273 | 35,039 | 3.41 |
| 1996 | 11,189 | 37,251 | 3.33 |
| 1997 | 11,608 | 38,172 | 3.29 |
| 1998 | 10,782 | 36,171 | 3.35 |
| 1999 | 12,878 | 45,113 | 3.50 |
| 2000 | 15,114 | 47,618 | 3.15 |
| 2001 | 14,633 | 49,910 | 3.41 |
| 2002 | 15,331 | 49,823 | 3.25 |
| 2003 | 18,082 | 59,393 | 3.28 |
| 2004 | 16,510 | 57,054 | 3.46 |
| 2005 | 17,447 | 62,117 | 3.56 |
| 2006 | 19,330 | 69,660 | 3.60 |
| 2007 | 30,813 | 112,443 | 3.65 |
| 2008 | 33,443 | 131,777 | 3.94 |
| 2009 | 37,042 | 150,424 | 4.06 |
| 2010 | 36,578 | 151,791 | 4.15 |
| 2011 | 31,126 | 133,325 | 4.28 |
| 2012 | 32,110 | 134,324 | 4.18 |
| 2013 | 36,262 | 146,757 | 4.05 |
| 2014 | 41,241 | 153,541 | 3.72 |
| 2015 | 51,954 | 187,941 | 3.62 |
| 2016 | 53,699 | 189,737 | 3.53 |
| 2017 | 46,344 | 174,683 | 3.77 |
| 2018 | 57,806 | 226,910 | 3.93 |
| 2019 | 53,846 | 213,335 | 3.96 |
| 2020 | 419 | 1,670 | 3.99 |
| Total (n) | 759,483 | 2,834,970 | 3.73 |
| Rows | 31 |
—
Statistics on Average_per_year:
…
| Average per year | |
|---|---|
| Min | 2.9445 |
| Max | 4.2834 |
| Median | 3.5333 |
| Mean | 3.5867 |
| Standard Deviation | 0.3662 |
…
n values:
Unique VAERS_ID: 759,483
Total number of reported symptoms: 2,834,970
The mean (3.5867) and median (3.5333) are very close to each other.
The standard deviation (0.3662) is approximately 1/10th of either.
—-
Analysis:
The average number of reported symptoms per VAERS_ID has stayed quite consistent between 1990 and early 2020.