Data 8R Summer 2017 1

Intro to Visualizations Discussion 1: June 27, 2017

Similarity and Randomization

Austen is running an experiment to determine the effects of various drugs on sleeping patterns. He puts up flyers around Berkeley, and accepts the first 200 applicants into his study. Each applicant agrees to take either a sample of a drug or a placebo, without knowing which they are receiving. 1.1 Will Austen be able to conduct a randomized controlled trial? Why or why not?

1.2 Austen wants to test the effects of alcohol on sleep. He gives his treatment group drinks, but gives nothing to his control group. Is this a blind study? Why or why not?

Ani is a political scientist who wants to study whether democratic countries tend to have higher average incomes. 1.3 Will she be able to conduct a randomized controlled trial? Why or why not?

1.4 What might be some confounding factors in her experiment?

2

Intro to Visualizations

2

Drug Effectiveness

A pharmaceutical company wishes to measure the effectiveness of a new drug designed to lower blood pressure for people with hypertension. They perform a randomized controlled trial and are attempting to summarize the results. They have produced the plot below:

2.1 What is the purpose of this visualization? What is it trying to communicate?

2.2 What are some problems with this plot?

2.3 How might this data be better represented? What changes would you make?

Intro to Visualizations

3

Here’s an attempt to improve the plot, using the same data:

2.4 What changes were made? What is better about this plot compared to the previous one?

4

Intro to Visualizations

3

Intel Chips

The chips that are present in your computer contain electrical components called transistors. Intel is one of the leading manufacturers of these chips; they released the first chip for home computers in 1979. We’d like to visualize the improvements in chips that Intel has made since 1979. We have the following plot:

3.1 What is the plot trying to communicate?

3.2 What are the problems with this plot? (There are quite a few.)

Intro to Visualizations Here’s an attempt to improve the plot:

3.3 What is clear from this plot that isn’t clear in the previous one?

Here’s another attempt to improve the plot:

3.4 What is better about this plot compared to the previous one?

5

6

Intro to Visualizations

4

Cooking Oils

This plot on the next page shows information about common cooking oils. 4.1 What is the purpose of this visualization? What is it trying to communicate?

4.2 What does the visualization convey well? What are problems with this visualization?

4.3 Suggest an alternative visualization that would better communicate what the original visualization was going for.

Intro to Visualizations

7

Data 8R Intro to Visualizations Summer 2017 1 Similarity and ... - GitHub

Jun 27, 2017 - The chips that are present in your computer contain electrical components called transistors. ... Here's another attempt to improve the plot:.

342KB Sizes 0 Downloads 257 Views

Recommend Documents

Data 8R Tables and more Visualizations Summer 2017 1 ... - GitHub
Jul 11, 2017 - At the same time, the researcher also records the number of ... A business has graphed the proportion of outputs in each year as a bar chart.

Data 8R Tables and more Visualizations Summer 2017 1 ... - GitHub
number of colds each volunteer gets. Is this an observational ... questions about it. A business has graphed the proportion of outputs in each year as a bar chart.

Data 8R Table Methods and Functions Summer 2017 1 ... - GitHub
Jul 18, 2017 - We have the dataset trips, which contains data on trips taken as part ofa ... def num_long_trips(cutoff): ... We want to see what the distribution of.

Data 8R Table Methods and Functions Summer 2017 1 ... - GitHub
Jul 18, 2017 - Data 8R. Table Methods and Functions. Summer 2017. Discussion 7: ... its range - the difference between the highest value in the array and.

Data 8R Intro to Python Summer 2017 1 Express Yourself! 2 ... - GitHub
Mike had a tremendous growth spurt over the past year. Find his growth rate over this 1 year. (Hint: The Growth Rate is the absolute difference between the final.

Data 8R Intro to Python Summer 2017 1 Express Yourself! 2 ... - GitHub
An expression describes to the computer how to combine pieces of data. ... inputs to a call expression are expressions themselves, you can have another call ...

Data 8R Plotting Functions Summer 2017 1 Midterm Review ... - GitHub
Data 8R. Plotting Functions. Summer 2017. Discussion 7: July 20, 2017. 1 Midterm Review. Question 4 ... function onto the table: Hint: Velocity = distance / time.

Data 8R Plotting Functions Summer 2017 1 Midterm Review ... - GitHub
Jul 20, 2017 - In physics calculations, we often want to have the data in terms of centimeters. Create a table called cm table that has the original data and a ...

Data 8R Hypothesis Testing Summer 2017 1 Terminology 2 ... - GitHub
Jul 27, 2017 - simulated on a computer. ... From the histogram, it looks like the higher mean from gambling was not at all that unusual - it certainly could have.

Data 8R Review of Table Methods Summer 2017 - GitHub
Jul 18, 2017 - We find that most trips have smaller length, but a few are very long. We want to see what the distribution of commute lengths looks like, and ...

Data 8R Review of Table Methods Summer 2017 - GitHub
Jul 18, 2017 - We also figure that commuters will be subscribers to the program, not one-time users. ... return np.mean(short_commute.column( Duration ) ...

Data 8R Data Types and Arrays Summer 2017 1 A Test of Skill - GitHub
1 A Test of Skill ... errors) of the following lines of Python code! >>> 6 / 3 ... Luckily, they've supplied a function named bar(labelArray, dataArray) to do.

Data 8R Data Types and Arrays Summer 2017 1 A Test of Skill - GitHub
Data 8R. Data Types and Arrays. Summer 2017. Discussion 4: July 6, 2017 ... Impress the squirrels with your knowledge of data types! .... 4 Data Manipulation.

Intro to Webapp - GitHub
The Public Data Availability panel ... Let's look at data availability for this cohort ... To start an analysis, we're going to select our cohort and click the New ...

lecture 3: more statistics and intro to data modeling - GitHub
have more parameters than needed by the data: posteriors can be ... Modern statistical methods (Bayesian or not) .... Bayesian data analysis, Gelman et al.

Intro to Webapp IGV - GitHub
Home Page or the IGV Github Repository. We are grateful to the IGV team for their assistance in integrating the IGV into the ISB-CGC web application.

Intro to Google Cloud - GitHub
The Cloud Datalab web UI has two main sections: Notebooks and Sessions. ... When you click on an ipynb file in GitHub, you see it rendered (as HTML).

Intro to Google Cloud - GitHub
Now that you know your way around the Google Cloud Console, you're ready to start exploring further! The ISB-CGC platform includes an interactive Web App, ...

Intro to Webapp SeqPeek - GitHub
brought to you by. The ISB Cancer Genomics Cloud. An Introduction to the ISB-CGC Web App SeqPeek. Page 2. https://isb-cgc.appspot.com. Main Landing ...

Intro to Google Cloud - GitHub
known as “Application Default Credentials” are now created automatically. You don't really need to click on the “Go to. Credentials”, but in case you do the next ...

Innovative Projects Summer 2017 - GitHub
Jan 31, 2017 - 10. Page 2 http://nokiawroclaw.pl/ https://github.com/nokia-wroclaw/ .... Develop a tool that will notify person via android app that some system ...

Reactive Data Visualizations - Semantic Scholar
of the commercial visualization package Tableau [4]. Interactions within data visualization environments have been well studied. Becker et al. investigated brushing in scatter plots [5]. Shneiderman et al. explored dynamic queries in general and how

intro slides - GitHub
Jun 19, 2017 - Learn core skills for doing data analysis effectively, efficiently, and reproducibly. 1. Interacting with your computer on command line (BASH/shell).

lecture 2: intro to statistics - GitHub
Continuous Variables. - Cumulative probability function. PDF has dimensions of x-1. Expectation value. Moments. Characteristic function generates moments: .... from realized sample, parameters are unknown and described probabilistically. Parameters a