Exploratory Data Analysis
Counts of the reported symptoms for each year
Year over year, is the ratio of reported symptoms increasing, decreasing, or staying consistent?
—
SELECT year (s_date) as Year , count(*) CNT_VAERS_ID , sum(sym_cnt ) Sum_sym_cnt from dbo.[VAERSDATA] group by year (s_date) order by 1
—
Year | CNT_VAERS_ID | Sum_sym_cnt | Average_per_year |
---|---|---|---|
1990 | 2,151 | 6,383 | 2.97 |
1991 | 9,992 | 29,421 | 2.94 |
1992 | 10,817 | 33,345 | 3.08 |
1993 | 10,309 | 35,089 | 3.40 |
1994 | 10,354 | 34,753 | 3.36 |
1995 | 10,273 | 35,039 | 3.41 |
1996 | 11,189 | 37,251 | 3.33 |
1997 | 11,608 | 38,172 | 3.29 |
1998 | 10,782 | 36,171 | 3.35 |
1999 | 12,878 | 45,113 | 3.50 |
2000 | 15,114 | 47,618 | 3.15 |
2001 | 14,633 | 49,910 | 3.41 |
2002 | 15,331 | 49,823 | 3.25 |
2003 | 18,082 | 59,393 | 3.28 |
2004 | 16,510 | 57,054 | 3.46 |
2005 | 17,447 | 62,117 | 3.56 |
2006 | 19,330 | 69,660 | 3.60 |
2007 | 30,813 | 112,443 | 3.65 |
2008 | 33,443 | 131,777 | 3.94 |
2009 | 37,042 | 150,424 | 4.06 |
2010 | 36,578 | 151,791 | 4.15 |
2011 | 31,126 | 133,325 | 4.28 |
2012 | 32,110 | 134,324 | 4.18 |
2013 | 36,262 | 146,757 | 4.05 |
2014 | 41,241 | 153,541 | 3.72 |
2015 | 51,954 | 187,941 | 3.62 |
2016 | 53,699 | 189,737 | 3.53 |
2017 | 46,344 | 174,683 | 3.77 |
2018 | 57,806 | 226,910 | 3.93 |
2019 | 53,846 | 213,335 | 3.96 |
2020 | 419 | 1,670 | 3.99 |
Total (n) | 759,483 | 2,834,970 | 3.73 |
Rows | 31 |
—
Statistics on Average_per_year:
…
Average per year | |
---|---|
Min | 2.9445 |
Max | 4.2834 |
Median | 3.5333 |
Mean | 3.5867 |
Standard Deviation | 0.3662 |
…
n values:
Unique VAERS_ID: 759,483
Total number of reported symptoms: 2,834,970
The mean (3.5867) and median (3.5333) are very close to each other.
The standard deviation (0.3662) is approximately 1/10th of either.
—-
Analysis:
The average number of reported symptoms per VAERS_ID has stayed quite consistent between 1990 and early 2020.