R and the World Cup
Across the street at the Revolution blog a nice example of using R with data from the cloud (see another post on this topic here) shows us the distribution of fouls during the just finished World Cup in a nice barchart. Even more interesting than the fact that Holland rules this category is the way the data are collected from a Google spreadsheet page.
With the following simple code line:
teams <- read.csv("http://spreadsheets.google.com/pub?key=tOM2qREmPUbv76waumrEEYg&single=true&gid=0&range=A1%3AAG15&output=csv")
We can read a specific part from a spreadsheet hosted on Google into our local R environment. Some deatils: “&gid=” (sheet number) and “%range=” (cell ranges: A1%3A ) and “&output=csv” to download in CSV format.
With some more lines, using the awsome
<br /> library(qqplot2)<br /> FOULS=t(DF2)[,c('Fouls')]<br /> qplot(names(FOULS), as.numeric(FOULS), geom="bar", stat='identity', fill=Fouls) + xlab('Country') + ylab('Fouls') + coord_flip() + scale_fill_continuous(low="black", high="red") + labs(fill='Fouls')
Two things to note:
c('Fouls') is a handy way to address columns in a R data frame by name
scale_fill_continuous(low="black", high="red") takes care of the color coding of the bars in reference to the number of fouls
Easy and straight forward - ah - great job Spain 🙂 !!