CSI 991
Seminar in Computational Statistics:
Data Mining
Spring/Summer, 2005
Fridays 3:00pm -- 5:00pm, Innovation Hall, Room 133
CSI 991
Spring: Section 004
Summer: Section X01
Contacts:
csutton@gmu.edu
jgentle@gmu.edu
The main reference text will continue to be
The Elements of Statistical Learning
by Hastie, Tibshirani, and Friedman (HTF).
Some links on this page are available only to members of the seminar group.
Schedule
- Jan 21
- Jan 28
- Feb 4
- Feb 11
- Feb 18
Cliff's Notes
- Feb 25
Cliff's Notes
Li's work on the experiment mentioned on p17 of HTF
- Mar 4
Cliff's Notes
Discuss the Boston housing data:
Jill's writeup
training data
test data
- Mar 11
Cliff's Notes
- Mar 18
Spring Break
- Mar 25
Presentation by Nan Chen on regression strategies for classification.
- Apr 1
Presentation by Muna on impact of similarity measures on clustering.
Reference: paper by Strehl, Ghosh, and Mooney.
Follow-up discussion of Nan's presentation.
- Apr 8
Further discussion of Nan's regression study.
Presentation by Jill on an alternative method of regression.
Further discussion of Strehl et al. paper on impact of similarity measures;
Cliff's Notes.
- Apr 15
Chapter 4 of HTF
Presentation by Li Li comparing effectiveness of some methods of Chapter 4
with other methods she previously studied.
- Apr 22
Announcement
Seminar by Andreas Buja, 10:30 am, SUB II, room 4
Degrees of Boosting -- A Study of Loss Functions for Classification and Class Probability Estimation
Preliminary version of paper
- Apr 29
- May 6
- May 13
No meeting.
- May20
Discussion of election data (from Harrell).
- May 27
More experimentation with the election data.
Li Li has made two subsets of this data:
"y" is response variable in both sets.
- June 3
- June 10
No meeting.
- June 17
- June 24
Cliff's Notes