Based on the videos and the reading material, how would you define a data scientist and data science? (3 marks) Data science is a relatively new field that, at its base, is the study of data, particularly large sets of data. It is a process in which data is collected, manipulated and analyzed in order to explore, understand and answer questions and make recommendations using various tools. Data science is important because data is always being generated, does not expire and can provide trends, potential problems and opportunities in almost every field, such as fraud detection, customer recommendations and weather predictions.
As discussed in the videos and the reading material, data science can be applied to problems across different industries. Give a brief explanation describing what industry you are passionate about and would like to pursue a data science career in? (2 marks)
Data science career is diverse, and there are various roles within the field. It's not just about data analysis but also involves machine learning, data engineering, data visualization, and more.
Based on the videos and the reading material, what are the ten main components of a report that would be delivered at the end of a data science project? (5 marks)
There are ten main components of a data science report: cover page, table of contents, abstract, introduction, literature review, methodology, results, discussion, conclusion, and appendices. First is the cover page, which provides the name of the report, the authors and their contact details and the date of publication. The table of contents lists the main sections in the report. The abstract provides a brief explanation of the arguments explored in the report. The introduction explains the problem in an introductory format, allowing for readers who may not be familiar with the topic to learn before delving into the heavier research. A literature review provides readers with relevant research that applies to the argument(s) being made. The methodology section is where you detail the research methods and sources used for your research. The results section is where you present your findings. The discussion section is where you explain your thesis and highlight your findings. The conclusion section is where you provide an overall summary of the results of your research and any possible recommendations you may have. Appendices is the last section, where you provide references and other data that may disrupt the flow of the report.