We are a group of researchers and analysts who are interested in data science and would like to use our expertise to contribute to the understanding of COVID-19 in our communities.
Looking for data…
One of the challenges we encountered trying to understand the spread of COVID-19 was finding a data source in a format that is easily accessible for analysis. When we were unable to locate such a file (and finding that the process to scrape data through R was too messy given the formats that the information has been released) we decided to take a manual approach. Using a few different sources, we have compiled data tables which are easily accessible in R (our favorite) and Python.
…Compiling our own
March 19, 2020
A COVID19 Googls Sheet for Ontario cases has been created, and is being maintained, with data from an Ontario government website and resources available on two Wikipedia pages. We will continue to update these tables until a more authoritative source of case records is made available, ideally by Public Health Ontario.
Resources: An invitation to explore and dive deeper
As we explore this data we will be sharing visualizations and insights on the Data User Group website. Our hope is that others will find our summaries useful. We extend an open invitation to others interested in data science to engage in additional analysis and use this data set for your own exploration. Resources include:
March 19, 2020
- A Github has also been created which will include the R code of our members as well as a .csv file that will be updated regularly. The Google Sheet will serve as our authoritative data source and the github will serve as our central repository for code. We invite any who are interested to contribute to the github.
- A Shiny app of the github code has been created to provide interactive explorations of the COVID19 data by regional spread.
Data Background and Sources
The “Provincial Reporting” tab in the Google Sheet is a compilation of data from this Ontario government website. This webpage provides a table on new cases of COVID-19 diagnosed in the province. Following are notes about the data:
- Using the Wayback Machine, the earliest records that could be obtained began at case 32.
- The first 31 cases were then compiled by parsing the press releases available at the bottom of the page.
- Currently, case numbers 6, 16, 17 and 18 have not been found in the available press releases.
- Coding with respect to regional health unit appears to have changed over time. A new column has been added with recoded health unit labels for consistency.
- On March 18th the website stopped posting the hospitals the cases are related to.
- The data in the Google Sheet is updated daily from this website, around 10:30 am and 5:30 pm when the data is released.
The “Wikipedia” tab in the Google Sheet is a compilation of data from Wikipedia’s Thematic Google Map. Included in this map are interactive summaries by region which includes the number of cases, number of patients in local hospitals and buildings that have been impacted. Following are notes about the data that is published on this page:
- The data in the Google Sheet is updated daily from this website.
- Wikipedia also provides a table with time-series data for the spread of COVID-19 for each province. The data from this website is available in “Wikipedia National” tab in the Google Sheet and is updated less frequently than the other two tabs.
A posting is now open until February 8, 2019 for a Research Analyst contract position at the Durham District School Board. The start date for this position is to be determined. This posting can be viewed on and applied to through applytoeducation.com.
Click here for more information
Summary of the Research Analyst: This position will take a lead role in a project … as part of the Ontario Education Equity Action Plan.
Reports To: Administrative Officer, Accountability and Assessment.
A posting is now open until July 18, 2018 for a Research Officer position at the Peel District School Board. The position begins September 4, 2018. This application is posted on applytoeducation.com (link at bottom of this pdf with more information) so to apply you will need to create an account first.
Click here for more information (link to apply at the bottom of the pdf)
The Peel District School Board (PDSB) is one of the largest school boards in Canada, with more than 150,000 students in over 250 schools. At PDSB, everything we do is designed to help all students achieve to the best of their ability. We have the incredible opportunity to inspire a smile in each student. Our collective, daily efforts make a positive difference in the lives of our students, their families and the world. Guided by our mission, vision and values, we build positive places for learning and working … together at http://www.peelschools.org We are currently accepting applications for a Research Officer.
Are you an experienced professional highly skilled in qualitative and quantitative research? Do you welcome the opportunity to draw on this expertise to support services and programs across the Peel District School Board? If so, take the next step in your successful career by joining our team.
Job Duties/Responsibilities and Details
Reporting to the Chief Research Officer, Research and Accountability, you will work both independently and as part of a team of education researchers in the design, implementation and interpretation of research and evaluation projects to support the board’s system-wide strategic goals, equity and diversity initiatives, and curriculum and instruction programs.
Being a Research Officer at the Peel District School Board means acting as a research and evaluation resource to support the use of data for planning and decision-making. This will include being responsible for consultation and development of assessments (curriculum, alternative programs, special education) as well as the evaluation of educational programs (equity, diversity, instruction, special education). The research
team and Board staff will also rely on your assessment of current educational trends, and on the literature reviews and environmental scans you can provide on topics of interest as they carry out their functions.
Over the past couple of years the Data User Group and Barrie Region MISA PNC have hosted a “Researcher Coffee Break” where school board employees with research, evaluation and data related roles can connect on a teleconference to discuss current issues, challenges or new approaches to common data sets.
Given the popularity of these periodic teleconferences, the Data User Group is collaborating with the Association of Educational Researchers of Ontario for a day of networking and sharing:
Click here for the flyer which includes a link to the online registration.
In collaboration with the Barrie, Toronto and London MISA (Managing Information for Student Achievement) PNCs (Professional Network Centre) a Special Interest Group has been scheduled for:
- Date: April 28th, 2015
- Time: 1:00-3:30
- Location: Toronto District School Board
This session will explore two existing community data models, the Social Risk Index (SRI) and the Learning Opportunities Index (LOI). Topics of the session will include:
- construction of the different models
- examples of how the models are used in the context of a school board
- comparison of the similarities and differences between the models.
The afternoon will wrap up with a general discussion of the Environics data that has been collaboratively purchased by the Barrie, London and Toronto PNCs.
Please click here to register for the event. Additional details regarding the location is included at the end of the registration.
New to the R Resources page is the “Qualitative Analysis in R” pdf. This resource, prepared by Greg Rousell, is an overview of getting started with qualitative data analysis in R using the RQDA package. It walks the reader through Data Cleaning, Text Mining, Manual Thematic Coding, Auto-Coding Text, and Creating (exporting) a coded file and includes both screen shots and sample scripts for ease of reference.
AERO has announced an upcoming Special Interest Group session “Working with Community Data Sets”.
This free event will be of interest to those who want to learn more about the approaches and development of vulnerability indices and the practical, technical and utilization issues associated with this work. The afternoon will feature:
- three socio demographic models that are used by education researchers in Ontario;
- discussion of issues related to the 2011 census;
- presentation of R and QGIS tips and tricks when working with socio demographic data.
If you are interested in attending or would like more information, send us a quick note and the registration information will be forwarded to you.
Click here for a pdf of the flyer.