EEB125H1 Data Science Conference#
The final project for this course will use data science methods including programming and statistical analysis of data on conservation risk and organismal biology. You will present your findings in the style of an oral presentation at an academic Data Science Conference (DSC). You and your team will give a short oral presentation of your findings, and be prepared to answer questions about your work.
Deliverables#
The Jupyter notebook (.ipynb) that produced the slides for the presentation, to be submitted by April 4, 23:59 (11:59 P.M. - Toronto time).
A 10 minute pre-recorded video presentation of your work.
Please see submission instructions below.
Submission Information#
Forming a Group#
First, select one student in the group to create the group. That student should:
Login to MarkUs.
Go to Proj_02: Course Project Phase 2 (Final Submission), and the
Submissionstab.Click on
Create Group.Then after your group is created, click on Invite student(s).
Enter the UTORids of each other group member (you’ll need to ask them for this information).
Then, every other student should:
Login to MarkUs.
Go to Proj_02: Course Project Phase 2 (Final Submission), and the
Submissionstab.Look for the group invitation and click on
Join Group. Warning: this must be done before the due date!!
Submitting Files#
Submitting your group’s files works the same way as other assignments, except only one person from the group needs to submit the files.
First, upload your video recording to the University of Toronto MyMedia platform, https://mymedia.library.utoronto.ca. Again, only one student in the group needs to do this.
After uploading the video, obtain the “Permanent Link” to the video but clicking on the “Share” button. This link should look something like
https://play.library.utoronto.ca/watch/1234567abcdefg...
Add this URL to the top of your group’s Jupyter notebook file, and clearly label it as the link to your presentation recording.
Then, submit your group’s Jupyter notebook file to MarkUs under Proj_02: Course Project Phase 2 (Final Submission).
Your TA will need to be able to see your full notebook file and access the video URL, so please double-check both of them before making your final submission!
Slides#
You should produce your slides using the Jupyter notebook Slideshow Extension RISE which is available on the UofT Jupyterhub. The template of a Jupyter notebook document to produce the slides for your presentation is here. You will be allowed to present ten content slides (i.e., does not include slides to break up sections described below).
Your presentation slides should include reproducible Python output (i.e., graphs, tables should be produced by Python code not hard-coded or inserted as an image), but not Python code unless it’s directly related to one of the sections below. See the template for an example of how to write code chunks to do this.
Your title slide must include the names of your team members, your tutorial section (e.g., TUTXXXX), and your group number as assigned by your TA.
Your slides should include the following sections:
Introduction
Data
Methods
Results
Conclusion
A few guidelines for an effective presentation:
Your slides should be clear, concise, and easy to read quickly.
Do not use small fonts as it will need to be read without zooming in.
Figures/plots/visuals often display information more efficiently than text.
Numbered or bulleted lists convey points in slides more effectively than blocks of text.
Video Presentation#
Your group will be asked to record a 10-minute video presentation summarizing your work. This time limit is firm and the grader will not watch beyond the 10-minute mark of any videos. Do not increase the video speed in order to squeeze it into the alotted time. Speech and pacing of the video should be digestible for the viewer/grader. Each team member must speak during this presentation.
Teamwork#
You will work in a group comprised of either 3 or 4 students in the same tutorial.
All team members are expected to contribute equally to the completion of the project. All team members must present part of the oral video presentation.
Your group may not work with members of another group. You may not discuss the project with anyone except for your team, professors, and the course TAs.
If you are concerned about any issues with your team, contact a member of the teaching team as soon as possible.
Evaluation#
Students will be evaluated as a team.
Grade component |
Value |
|---|---|
Content of slides |
50% |
Reproducibility of slide content |
10% |
Oral Video Presentation |
40% |
Reproducibility of Slides#
The rubric for the content of the slides evaluation is below.
import pandas as pd
slidesrubric = pd.read_csv('confrenceslidesrubric.csv', keep_default_na=False)
slidesrubric.style.hide_index()
/tmp/ipykernel_41961/3507230686.py:3: FutureWarning: this method is deprecated in favour of `Styler.hide(axis="index")`
slidesrubric.style.hide_index()
| Criteria | Category | Excellent | Good | Adequate | Poor |
|---|---|---|---|---|---|
| Content | Reasonable scope | The scope of the analysis is clear and questions can be fully addressed using the available data. | The scope of the analysis is clear and questions can be reasonably addressed using the available data. | The scope of the analysis is less clear, the questions can somewhat be addressed using the available data with slight modifications. | The questions are beyond the scope, cannot be reasonably addressed with the available data; need to resort to additional data or complete modification. |
| Data wrangling | Creative use of data wrangling to produce informative variables. | Appropriate use of data wrangling to create sensible variables. | Some use of data wrangling to create new variables. | No evidence of data wrangling to create any variables. | |
| Graphical display | Choice of graphs are appropriate and creative; graphs reveal useful information and tell a story. Meaningful captions and titles. | Choice of graphs are appropriate; graphs reveal useful information, but are not self-sufficient. Might require some explaining. | Choice of graphs are appropriate; graphs reveal some useful information. Might require some explaining and minor changes to titles/axes/labels, etc. | A lack of visual aid; graphs are inappropriate, reveal no information. | |
| Statistical methods | The choice of method is appropriate; analyses are complete; diverse and creative use of more than one approach. | The choice of method is appropriate; some non-essential analyses are missing | The choice of method is somewhat appropriate; some analyses are missing. | The choice of method is inappropriate; essential analyses are missing. | |
| Appropriate conclusion | Results are clearly and completely summarized. Appropriate limitations and concerned clearly stated. | Results are completely summarized. Some limitations and concerned are stated. | Some results are summarized. The conclusion is not appropriate and no mentioning of any limitations. | Results are not summarized and conclusion is missing. | |
| Writing | Organization | Contents are very well organized under the appropriate section and subsection headings. | Contents are organized under the appropriate section and subsection headings. | Contents are somewhat organized under section and subsection headings. | Contents are poorly organized under section and subsection headings. |
| Overall Writing | Very polished and well written. | Few errors in spelling, punctuation, and/or grammar. Mostly clear and understandable. | Partly unclear, but mostly understandable. Several errors in spelling, punctuation, and/or grammar. | Too many errors in spelling, punctuation, and/or grammar, which make it unclear and difficult to follow. |
Your TA will attempt to reproduce your slides using the Jupyter notebook (.ipynb) and data files you submit.
If your TA cannot run the .ipynb files you submit to reproduce your slides content then your group will receive 0; if the TA has to make minor changes to get it to run then your group will receive 1; and if it runs with no changes then your group will receive 2.
If the Jupyter notebook and other files are submitted after the deadline every member of your group will lose 10% of your overall final project mark as long as they are submitted at most 24 hours after the deadline. The project files will not be accepted more than 24 hours after the deadline.
Oral Presentation#
Your group will record a 10 minute video presentation about your work. The rubric for the presentation is below.
oralrubric = pd.read_csv('Oralpresrubric.csv', keep_default_na=False)
oralrubric.style.hide_index()
/tmp/ipykernel_41961/1392216112.py:2: FutureWarning: this method is deprecated in favour of `Styler.hide(axis="index")`
oralrubric.style.hide_index()
| Criteria | Excellent (Rare) | Good (Common) | Adequate (Common) | Poor (Very rare) |
|---|---|---|---|---|
| Preparedness | Extremely prepared and rehearsed. | Primarily prepared but with some dependence on written notes. | The presenter was not well prepared and sometimes reading off notes. | Evident lack of preparation/rehearsal. Complete dependence on notes. |
| Speech clarity | Words were articulated clearly and distinctly, and very easy to understand. | Words were articulated clearly and distinctly most of the time, easy to understand. | Clear attempts to enunciate, with some occasional mumbling, but still understandable. | A lot of word slurring or mumbling, barely understandable. |
| Content Clarity | Just the right amount of explanation and details were given, the presentation effectively achieved its points. | Sufficient explanation and details were given, the presentation achieved most points. | Some explanation, too little or too much details were given, the presentation achieved some points. | No explanation, insufficient or too many unnecessary details, the presentation was confusing and had no clear objectives. |
| Transitional Phrases | Effective use of words and phrases to enhance the flow and signal transitions. | Good use of words and phrases to control the flow and signal transitions. | Some use of transitional words and phrases to signal transitions. | Lack of transitions and a poor progression of flow. |
| Vocabulary | Accurate use of statistical terms and phrases, and the presentation was professional and polished. | Good use of statistical terms and phrases whenever necessary. | Demonstrated efforts to incorporate statistical terms and phrases, but some were used inaccurately. | Completely inaccurate and wrong use of statistical terms and phrases and signals a lack of understanding. |
| Delivery | Well-paced, good volume, there was eye contact and the presenter was confident. | Good pace and volume, there was some eye contact and the presenter seemed confident. | Pacing could be improved, volume or eye contact was not consistent. | Poor pacing, barely audible, a lack of eye contact. |
| The wow factor | Overall an excellent and impressive presentation. |
Every member of the team is expected to speak as part of the oral presentation. One way to do this is to have each team member present at least one slide.
If a student in a group speaks, but doesn’t present in their group’s content then they will receive 50% of the group mark.
If a student doesn’t speak at all during the presentation then they will receive 0.