From bee48e79ccae5f655fb4a54db341e6e333ace176 Mon Sep 17 00:00:00 2001
From: kayliebrehm <kbrehm1@avc.edu>
Date: Wed, 6 Jul 2022 01:43:35 +0000
Subject: [PATCH 1/5] Initial Q1

---
 gss2018.rmd | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/gss2018.rmd b/gss2018.rmd
index 4774400..7007036 100644
--- a/gss2018.rmd
+++ b/gss2018.rmd
@@ -1,7 +1,7 @@
 ---
 title: "General Social Survey"
-author: "Your Name"
-date: "Year 2020"
+author: "Kaylie Brehm"
+date: "Summer 2022"
 output: 
   html_document:
     number_sections: true

From 84cb45119fc60b95e753899e518af6205635756b Mon Sep 17 00:00:00 2001
From: kayliebrehm <kbrehm1@avc.edu>
Date: Wed, 6 Jul 2022 02:19:15 +0000
Subject: [PATCH 2/5] Q1 Method

---
 GrabData.R | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/GrabData.R b/GrabData.R
index f4cc802..4aeab06 100644
--- a/GrabData.R
+++ b/GrabData.R
@@ -12,4 +12,5 @@ mydata3 <- filter(mydata2,BIGBANG=="True" | BIGBANG=="False",EVOLVED=="True"|EVO
 GSSdata <- filter(mydata3,CAPPUN=="FAVOR"|CAPPUN=="OPPOSE",VOTE12=="Voted"|VOTE12=="Did not vote",VOTE16=="Voted"|VOTE16=="Did not vote") %>% droplevels()
 levels(GSSdata$VOTE12)[1] <- "voted12"
 levels(GSSdata$VOTE12)[2] <- "no in 12"
-rm(Gss,Gss1,mydata,mydata2,mydata3)
\ No newline at end of file
+rm(Gss,Gss1,mydata,mydata2,mydata3)
+

From 6d4ea24cf462dba43614591bb31034f8a6992c16 Mon Sep 17 00:00:00 2001
From: kayliebrehm <kbrehm1@avc.edu>
Date: Wed, 6 Jul 2022 02:23:07 +0000
Subject: [PATCH 3/5] Q1 Data

---
 gss2018.rmd | 26 +++++++++++++++++++++++++-
 1 file changed, 25 insertions(+), 1 deletion(-)

diff --git a/gss2018.rmd b/gss2018.rmd
index 7007036..d5de124 100644
--- a/gss2018.rmd
+++ b/gss2018.rmd
@@ -26,29 +26,53 @@ source("GrabData.R")
 The data in the dataframe GSSdata is from the 2018 General Social Survey. The first blocks of R-code has selected down a subset of the data to just 16 variables. It has further removed unwanted factor levels in much of the data. Examine the code in the GrabData.R file to see what it is doing. Some of the variables are categorical and others are numerical. Be sure to do a variable analysis before tackling each question.  
 First question - Is opinion on the death penalty (CAPPUN) independent of gun ownership (OWNGUN)?
 
+$H_0$ Opinion on death penalty is not independent on ownership of gun.  
+$H_A$ Opinion on death penalty is independent on ownership of gun. 
 
 ## Methods
 
 <!--Decide on your methods:  use "variable analysis" or other appropriate descriptors.  Make sure to choose at least one graphical method and at least one numerical method.!-->
 
-##Results
+Both are categorical variables, each with two levels. Owning a gun would be a yes or no. Opinion on death penalty would be for or against. The analysis technique we will use is CAT~CAT. The results will show a bar chart, some numerical values, a fisher exact test for odds, and a chi-square test of independence.
+
+
+## Results
 
 <!--Divide this section into two sub-sections:  One for your descriptive  results and one for your inferential results.!-->
 
 ### Descriptive Results
 
+
+
 #### Graphical Descriptive Results
 
 <!--Graphical results here.  Make sure to show your code.  Provide appropriate labels for axes, giving units if possible, and provide a good title for the graph, too.  Use the graphical results to describe the patterns if any that exist in the data as focused toward the research question!-->
 
+```{r}
+barchartGC(~CAPPUN + OWNGUN,data=GSSdata)
+barchartGC(~CAPPUN + OWNGUN,data=GSSdata, type="percent")
+```
+
 #### Numerical Descriptive Results
 
 <!--Numerical results go here. Use the numerical results to describe the patterns if any that exist in the data as focused toward the research question!-->
 
+```{r}
+table2 <- xtabs(~CAPPUN + OWNGUN, data=GSSdata)
+rowPerc(table2)
+colPerc(table2)
+```
+
 ### Inferential Results
 
 <!--State hypothesis clearly.  Make sure your discussion of the inferential test covers all the aspects that the test output produces, such as test statistic, p-value etc.  Make a decision about the null hypothesis, explain the assumptions on which the selected test/procedure was based, and why the chosen procedure satisfys the assumptions and is appropriate to answer the research question!-->
 
+```{r}
+chisq.test(table2)
+chisqtestGC(table2)
+fisher.test(table2)
+```
+
 #  Question 2
 
 <!--In this section you explain what you are trying to show.  Where did the data come from?  What is the research or other question you are trying to answer?!-->

From 1f8b5b4c450d19b4399e4eff813c9669925aed14 Mon Sep 17 00:00:00 2001
From: kayliebrehm <kbrehm1@avc.edu>
Date: Wed, 6 Jul 2022 02:59:47 +0000
Subject: [PATCH 4/5] Q1 Analysis

---
 gss2018.rmd | 25 ++++++++++++++++++++-----
 1 file changed, 20 insertions(+), 5 deletions(-)

diff --git a/gss2018.rmd b/gss2018.rmd
index d5de124..4e7e137 100644
--- a/gss2018.rmd
+++ b/gss2018.rmd
@@ -26,8 +26,8 @@ source("GrabData.R")
 The data in the dataframe GSSdata is from the 2018 General Social Survey. The first blocks of R-code has selected down a subset of the data to just 16 variables. It has further removed unwanted factor levels in much of the data. Examine the code in the GrabData.R file to see what it is doing. Some of the variables are categorical and others are numerical. Be sure to do a variable analysis before tackling each question.  
 First question - Is opinion on the death penalty (CAPPUN) independent of gun ownership (OWNGUN)?
 
-$H_0$ Opinion on death penalty is not independent on ownership of gun.  
-$H_A$ Opinion on death penalty is independent on ownership of gun. 
+$H_0$ Opinion on death penalty is not independent on ownership on gun.  
+$H_A$ Opinion on death penalty is independent on ownership on gun. 
 
 ## Methods
 
@@ -48,11 +48,21 @@ Both are categorical variables, each with two levels. Owning a gun would be a ye
 
 <!--Graphical results here.  Make sure to show your code.  Provide appropriate labels for axes, giving units if possible, and provide a good title for the graph, too.  Use the graphical results to describe the patterns if any that exist in the data as focused toward the research question!-->
 
-```{r}
-barchartGC(~CAPPUN + OWNGUN,data=GSSdata)
-barchartGC(~CAPPUN + OWNGUN,data=GSSdata, type="percent")
+We create two bar charts - one based on frequency and the other on percent.
+
+
+````{r}
+dd2 <- GSSdata %>% group_by(CAPPUN,OWNGUN) %>% summarize(count=n()) %>% mutate(prcnt=count/sum(count))
+# the group_by followed by summarize(count=n())
+basicC <- ggplot(dd2,aes(x=CAPPUN,y=count,fill=OWNGUN))
+basicC + geom_bar(stat="identity",position="dodge")
+#Now for percentage plot
+basicCC <- ggplot(dd2,aes(x=CAPPUN,y=prcnt*100,fill=OWNGUN)) 
+basicCC + geom_bar(stat="identity", position = "dodge")
 ```
 
+Based on the data, it is apparent that those who oppose capital punishment, are more likely to say no to gun ownership. In those that favor capital punishment, slightly more people say no to gun ownership.
+
 #### Numerical Descriptive Results
 
 <!--Numerical results go here. Use the numerical results to describe the patterns if any that exist in the data as focused toward the research question!-->
@@ -63,6 +73,8 @@ rowPerc(table2)
 colPerc(table2)
 ```
 
+The top data set shows percentages for each opinion on capital punishment in relation to opinion on gun ownership. About 70.97% of those who oppose capital punishment are against owning a gun. About 51.72% of those who favor capital punishment are also against owning a gun.
+
 ### Inferential Results
 
 <!--State hypothesis clearly.  Make sure your discussion of the inferential test covers all the aspects that the test output produces, such as test statistic, p-value etc.  Make a decision about the null hypothesis, explain the assumptions on which the selected test/procedure was based, and why the chosen procedure satisfys the assumptions and is appropriate to answer the research question!-->
@@ -73,6 +85,9 @@ chisqtestGC(table2)
 fisher.test(table2)
 ```
 
+If the opinion for capital punishment is dependent on opinion on gun ownership, then there is a difference, meaning it is not 50/50 equal results. The Chi-Square adds up this difference and subtracts what we would expect if the null hypothesis were true. The P-Value is the probability that the null hypothesis is true. The p-value of the chi square test is 0.02022. Since this p-value is under 0.05, I reject the null hypothesis due to it being so small. The p-value of the fisher exact test is 0.01651. Since this p-value is under 0.05, I once again reject the null hypothesis due to it being so small. The odds ratio was 2.271 which is 1.271 away from one. So the probability that capital punishment opinion is dependent on gun ownership opinion is 127%.
+
+
 #  Question 2
 
 <!--In this section you explain what you are trying to show.  Where did the data come from?  What is the research or other question you are trying to answer?!-->

From ff66e088cd601a41e50733a61b23b8c4231c2933 Mon Sep 17 00:00:00 2001
From: kayliebrehm <kbrehm1@avc.edu>
Date: Wed, 6 Jul 2022 03:54:42 +0000
Subject: [PATCH 5/5] Q1 Edit

---
 gss2018.rmd | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/gss2018.rmd b/gss2018.rmd
index 4e7e137..4ce2eca 100644
--- a/gss2018.rmd
+++ b/gss2018.rmd
@@ -85,7 +85,7 @@ chisqtestGC(table2)
 fisher.test(table2)
 ```
 
-If the opinion for capital punishment is dependent on opinion on gun ownership, then there is a difference, meaning it is not 50/50 equal results. The Chi-Square adds up this difference and subtracts what we would expect if the null hypothesis were true. The P-Value is the probability that the null hypothesis is true. The p-value of the chi square test is 0.02022. Since this p-value is under 0.05, I reject the null hypothesis due to it being so small. The p-value of the fisher exact test is 0.01651. Since this p-value is under 0.05, I once again reject the null hypothesis due to it being so small. The odds ratio was 2.271 which is 1.271 away from one. So the probability that capital punishment opinion is dependent on gun ownership opinion is 127%.
+If the opinion for capital punishment is dependent on opinion on gun ownership, then there is a difference, meaning it is not 50/50 equal results. The Chi-Square adds up this difference and subtracts what we would expect if the null hypothesis were true. The P-Value is the probability that the null hypothesis is true. The null hypothesis was "Opinion on death penalty is not independent on ownership on gun." The p-value of the chi square test is 0.02022. Since this p-value is under 0.05, I reject the null hypothesis due to it being so small. The p-value of the fisher exact test is 0.01651. Since this p-value is under 0.05, I once again reject the null hypothesis due to it being so small. The odds ratio was 2.271 which is 1.271 away from one. So the probability that capital punishment opinion is dependent on gun ownership opinion is 127%.
 
 
 #  Question 2