CSI 991 Section X01
Contacts:
csutton@gmu.edu
jgentle@gmu.edu
Some links on this page are available only to members of the seminar group.
Members Enter |
The seminar will be conducted in the form a workshop. Participants are encouraged to put R on their own laptops, and to bring them to the seminars.
The main reference text is
R Graphics,
by Paul Murrell.
We will work through the examples in the text, approximately one chapter
per week, beginning with Chapter 2. The code for the graphs in the
text are available at
the author's website.
We want to understand everything about the example graphs, so hopefully
someone will have figured out all of the details before the class
presentation.
We will also discuss methods of incorporating graphs in TeX documents, and the R code necessary to produce other interesting graphs, such as those in The Elements of Statistical Learning, Data Mining, Inference and Prediction by Hastie, Tibshirani, and Friedman.
There are a number of useful books on R graphics and R in general. A list of books is available at the "Books" link on the main webpage for the R Project.
Frank Harrell has a very useful website on S/R resources. One of the links at that site is to a very useful introductory manual on S (and R) by Carlos Alzola and Harrell.
Students should consider participating in the 2006 UCSD Student Data Mining Competition.
We will occasionally discuss various issues in statistical learning. There is an interesting article in the February, 2006, issue of Statistical Science (published by IMS, and available online to members), Classifier technology and the illusion of progress, by David Hand, with discussion.
We will also work on various exercises using R graphics. Exercises can be suggested by anyone, and all participants are encouraged to submit exercises.
An important characteristic of a graph is the aspect ratio. The function eqscplot() in MASS helps to control this.
The symbols for mathematical objects should be used consistently in a document. This means that labels in graphical displays should use fonts similar to those used in the text of the document. In R graphics this may require expression() and italic(). Other R expression tokens are similar to TeX without an escape character, e.g., sqrt(), hat(), theta, Theta, etc. Some tokens are available only in one font family, e.g. theta and Theta are always upright.
Use "help(plotmath)" to see a description of the characters.
Use "demo(plotmath)" to see a demo.
The general syntax for producing text follows that of C; for example, "\n" gives a new line, "\"" gives a double quote mark, etc. Some of the old C escape sequences are irrelevant, such as "\?", and some don't work, such as "\b".
Some functions are platform-dependent, such as savefile() which is available in MS Windows.
We will begin by clearing up several issues from Chapter 2, such as the orientation of the axes labels, the placement of plots in mfrow(), and even some simple things, like what is the bannerplot, and what is going on with barplot() in the examples.
We will then proceed into Chapter 3, where the placement of plots will
be considered more systematically.
Presentation by Johan Bjursell.
Continuation of material in Chapter 3.
Presentation by Ewa Gradzka.
Continuation of material in Chapter 3.
Presentation by Keith Roberts.
3-D graphics
Grid graphics: Chapter 4.
Presentation by Johan Bjursell.
Continuation of material in Chapter 4.