Getting your Trinity Audio player ready...

Data are everywhere. With today’s powerful smartphones, tablets and computers anyone, from school children to seniors, can learn how to analyze and make graphics from data. Maybe you want to take a closer look at financial data from Intuit’s Quicken or Mint, organize answers to a survey, understand government statistics better, predict what your customers will buy or follow stock prices.

Spreadsheets, like Microsoft Excel or Apache’s Open Office Calc (free), have statistical and graphing functions as a start. If you want to go beyond a spreadsheet and organize data into groups, plot many graphs at once or build statistical models then it’s worth learning the rapidly growing language statistical R. Modules let you integrate R with spreadsheets.

During a three-month break from writing this blog, I took a couple of online courses that used R:
– Johns Hopkins University’s Coursera course “Computing For Data Analysis” (rebranded R Programming with a new session starting May 5)
Stanford University’s online course, “Statistical Learning”

In the mid-1970s, John Chambers at Bell Laboratories created the S language to analyze statistics. It launched on the UNIX operating system in 1978. That’s before VisiCalc, the first spreadsheet, that appeared for the Apple II in 1979.

In 1996, Robert Gentleman and Ross Ihaka from the University of Auckland, announced their development of the R language, inspired by S. They attracted collaborators worldwide to build the powerful R software that’s available today.

R is freely available, open source software with many components. Today there are 5449 available packages on the leading CRAN software library, for everything from insightful graphics, to biomedical data analysis to financial forecasting.

Palo Alto’s Tibco offers a commercial version of S, S+, as well as TERR (Tibco Enterprise Runtime for R). Revolution Analytics, headquartered in Mountain View, was founded in 2007 to sell R software and services to commercial users. Recently, the company was named an Advanced Analytics “Visionary” company by analyst firm Gartner. Gartner estimates advanced analytics to be a $2 billion market that spans a broad array of industries globally. Revolution Analytics, lets you run R in Amazon’s cloud for data sets as large as a terabyte.


Source: Bay Area useR Group logo

Last week, I attended the monthly Bay Area useR Group Meetup, for those interested in R. The April meeting was at Intuit. There were 7 excellent speakers, but I’m only going to write about one – Ram Narasimhan. Ram teaches R for UCSC Extension. He gave an amusing talk on analyzing weather data – this link shows several examples of how he plotted weather data.

It started a couple of years ago, when he moved to Silicon Valley and his wife thought the weather was better in Chicago than in Sunnyvale. Ram decided he’d write some R software to test this. As averages can be misleading (see for example Stanford Consulting Professor Sam Savage’s short talk on the Flaw of Averages), Ram selected the minimum temperature, as long as it was over 50°F. He ended up writing a package ? weatherData – to gather weather station data for different locations and date ranges.


City Mean, Maximum and Minimum Temperatures, by Month for Austin, Las Vegas, San Diego, San Francisco and Tampa (Source: http://ramnarasimhan.wordpress.com/tag/weather-analysis/)

One weather data source is Weather Underground which shows the personal weather station ? Mountain Shadows KCAMOUNT15 ? in Mountain View. This site shows temperature, dew point, wind speed, pressure and precipitation starting in 2008. For an organizational site, Moffett Field’s station KNUQ gives weather data on Weather Underground as far back as 19:00hrs on March 2nd 1945. Comparing the two sites, you can see that temperatures can vary, even within a small town, given the marine effect near Moffett.

Playing with R gives insights into the challenges of using weather station data to see if Mountain View is warming or cooling. Which station should you use? Most days, at Moffett, the weather data is collected approximately once an hour, but other days there are more observations. Do you want to include night temperatures, or just day temperatures? Whereas climate models show the world in general is warming, climate is not the same as weather, which can jiggle around from day to day. You can read more about climate on NASA’s website.

If you want to learn about statistics, probability and survey sampling, then Stat Trek is a useful site. To get started with R, download the free RStudio development environment which comes with training and help files. Type in a sum ? “5+3” ? and the answer “8” will appear. At a minimum RStudio can be used as a calculator. To analyze the local weather data from Moffett Field from April 1st, 2005 to April 8th, 2005. Open RStudio, install the package weatherData from CRAN, then type in:
mvweather
followed by:
summary(mvweather) to get a summary of temperature data
or you could type:
mvweather
to see a list of temperatures.

Being interactive, R provides a fun way to learn statistics and quickly visualize data.

Most Popular

Join the Conversation

45 Comments

  1. Thanks a lot for nice article. Thanks for introducing Mr.Ram Narasimhan here with his work. Its really great to know how R language will help us to streamline most complex data into easy-read one. Thanks again.

  2. Angela, thanks for the great article. That was a fun meeting, and the Bay Area useR Group Meetup is routinely one of the more positive and interesting Meetups I attend.

    I also like the R training steps you shared. In that direction I would like to alert readers to a new book by Nina Zumel and myself (John Mount): Practical Data Science with R ( http://www.manning.com/zumel/ ). This book is about how to do data science (defining problems, working with others, analyzing data, presenting results, and deploying decision models to production) using R. The book ends up working through about 10 substantial examples and is a good tour through a lot of the tasks required of a data scientist (though by staying in R we avoid big data and a lot of other tools; all of which are good follow-up topics).

  3. this is very good nice article. this is very useful for R Language </a> students.

    ==========================================================
    This is very nice article. This is very use ful for R Language Learners.

    ===========================================================

    hi sir. i want to do R Language training.
    R Language </a> ONlINE TRAINING Thanks for providing valuable information.

  4. One should note here that virginia state police gun background check, open records act texas public information act and inmates information. In any case, indian river county public records marriage, deming new mexico public records or tx dps secure criminal history. In addition to this bank of america background check for employment more preferably than run background check on tenant.

  5. station should you use? Most days, at Moffett, the weather data is collected approximately once an hour, but other days there are more observations. Do you want to include night temperatures [url-http://www.behance.net/gallery/49091493/March-Calendar]july calendar 2017[/url]

  6. Thanks for sharing such an important article, keep updating us.

    Leading networking company CISCO launched a service called CISCO SUPPORT service to help its users and it also has a toll free number to provide instant help.

    For any problem regarding networking go to
    http://supportnumbers.net/

  7. thanks for introducing r language by Mr. ram Norishima, your post about R language give an idea about forecasting and graph also

    Norton support number

  8. Want to use premium Android apps for free? Now it is possible to remove license verification from any apps and games that you want! All you need a small app for Android named Lucky Patcher.

    Its very easy to use the app. But remember you have to root your Android smartphone to remove license verification from apps or games.
    Lucky Patcher Version 6.5.6 :
    Fix bug to Proxy server for Google Play.
    Update “About” and convert links to clickable.
    Translations updated.
    Custom patches updated.
    Bugs fixed.
    Lucky Patcher Version 6.5.5 :
    Fix bug to Xposed module LP.
    A new extension (lpzip) has been added. It is a zip archive, which can contain apk files and custom patches. Apk will be installed. Custom patches will be automatically imported into Lucky Patcher.
    Custom patches updated.
    Bugs fixed.
    Translations updated.

    https://luckypatcherapklatest.com/download-lucky-patcher-apk-for-android/
    https://luckypatcherapklatest.com/

  9. Having a Gmail account is almost a prerequisite today. Yes, there are other email facilities and then there is Google! The benefits of a Google account far exceed those provided by Gmail; you get access to all the apps offered by Google. But you need not use all of them if you do not want to; using just Gmail is also possible.When you sign up for Gmail, you are automatically eligible to use all the other services of Google. You have to activate some of them, while others are automatically activated for you.When you sign up Gmail, you are asked basic questions. Not all of these questions need to be answered mandatorily, but it is recommended that you answer all of them so that Google can optimize your Gmail experience.
    http://helpgmaillogins.com/www-gmail-com-sign-up-in-login/
    http://helpgmaillogins.com/
    http://helpgmaillogins.com/how-to-delete-gmail-account-permanently/

Leave a comment