I’m working on a r project and need an explanation to help me understand better.Objective: Create a Statistical Analysis Plan (SAP) based on data obtained by web scraping using the rvest R package.Overview: You are required to obtain a dataset of interest to you by ‘scraping’ data from a website of your choosing (and for which you have permission) using the rvest R package.You will then pose at least 2 research questions that you could explore using the dataset.
You will then submit your data and a document that summarises your dataset, lists your research questions and includes a Statistical Analysis Plan (SAP) which outlines how you would explore your research questions.Task Details:
Task 1 – Obtain a dataset of interest to you by ‘scraping’ data from a website of your choosing using the rvest R package and identify two research questions that could be explored using the dataset.Before you scrape the data, ensure that you have permission by using the paths_allowed() function. Once imported into R, store your data as a tibble with a meaningful name and give the variables appropriate names and ensure the variables are the correct data types in R (i.e. character, numeric, integer, factor, logical, etc.).Export your data to an Excel (.xls) file.(The data should have between 50 and 500 observations. )
Task 2 – Produce a Word or PDF document containing: The URL (web address) of the dataundefined
The context of the data and it’s variablesundefined
The content of the data as summarised by the output of the str() function applied to the tibble containing the data
The research questions you want to explore using the data
A SAP for exploring the research questions using the data
There is a template for the document available on Moodle and your SAP must be no longer than 2 A4 sheets in length.