Fill This Form To Receive Instant Help
Homework answers / question archive / MAS8403 Project: Penguins! Submit your project report via Canvas by 23:45 on Sunday 24th October 2021
MAS8403 Project: Penguins!
Submit your project report via Canvas by 23:45 on Sunday 24th October 2021.
Your submission should be a single, coherent report submitted in PDF form. The page limit for the report is 8 pages. Any reports going over this limit will be penalised. This limit does not include the cover page, bibliography or appendix. You do not need to include an abstract/executive summary.
The Palmer Station located in the Palmer Archipelago on Anvers Island, Antarctica, has been monitoring the ecology of the Palmer Long-Term Ecological Research (LTER) study area for over 50 years. You can see what’s going on at the Palmer Station currently by clicking here. Being on Antarctica, naturally one of their keen interests is monitoring the local penguin population from which they record data in order to understand their population dynamics, responses to changing climate etc.
The palmerpenguins dataset contains data measured on 333 penguins from the Palmer Archipelago.
The variables observed are:
Install the palmerpenguins package and access the data
install.packages("palmerpenguins") # You only need to do this once library(palmerpenguins) data("penguins")
penguins = na.omit(penguins) # Removes missing rows
Run the following code to access your unique subset of the penguin dataset
my.student.number = 123456789 # Replace this with your student number set.seed(my.student.number)
my.penguins = penguins[sample(nrow(penguins), 100), ]
the object my.penguins now contains the data on your 100 penguins.
You are to produce a report which comprises of an exploratory data analysis of the data on your sample of 100 penguins. In this exploratory analysis you should consider the most appropriate graphical and numerical summaries for your data, along with appropriate measures of uncertainty on these numerical summaries.
Sexing (i.e. determining the sex) of a penguin can often be very difficult without causing distress to the penguin. Researchers at the Palmer station would like to be able to estimate the sex of a penguin from measurement data, thereby avoiding the need for invasive procedures. From your data, which variables appear to be the best at distinguishing between male and female penguins? How reliable do you think they would be at identifying the sex of a penguin?
One scientist wishes to compare the weights of male and female penguins. To do this they propose a statistic p∗ which is the probability that a randomly selected male from a sample will be heavier than a randomly selected female from the sample. In your report include the calculated value of p∗ for your sample of penguins. What does this mean for the weights of males vs females? What are the good/bad properties of using a statistic like p∗ to compare two groups?
The scientists also have several questions they’d like to investigate:
From your data, statistically evaluate each of these questions and state your conclusions as part of your report.
Reports will be marked on the university scale. Credit will be given for:
As such the marking scale (out of 50) will be: