Department of Statistics Seminar
North Carolina State University
presents
Laine
Elliott, Dhruv Sharma, Weihua
Cao
A SAMPLER BY NCSU STATISTICS STUDENTS
1. Speaker: Laine Elliott
Title: Adjusting
for Measurement Error by Matching Moments with the Latent Variable of Interest
Abstract: A variety of complications arise when imperfect
measurements, W, are observed in place of a true variable of interest, X. When interest focuses on estimation of the density of
X, this presents an obvious problem, because the mis-measured
data do not have the same distribution as the latent variable X. In the
context of linear and non-linear regression models where X is a covariate,
regression parameter estimators obtained when W is substituted for X may be
substantially biased, and statistical power may be compromised. The
effects of measurement error have been thoroughly studied. Many strategies
for correcting for measurement error depend on the specific modeling or
regression context and can be intractable in highly non-linear models. In
our work, we take an alternative perspective focused on re-creating the
distribution of X from the observed W, either as the primary quantity of
interest or as a means to improving parameter estimation. We obtain estimates
of X for which the first M sample moments are unbiased for the corresponding
moments of X. We investigate the benefit of substituting these estimates for X
in density estimation and a variety of non-linear regression models. This
method has the advantage that once the estimates of X are obtained, they can be
substituted in any model, including complicated non-linear models.
Regression parameter estimators that depend primarily on M moments will be
virtually unbiased. I will describe some of the results from this
work.
*************
2. Speaker: Dhruv Sharma.
Title: Simultaneous Variable
Selection and Clustering for SVMs
Abstract:
Variable selection for
high-dimensional data analysis offers unique challenges, particularly when
there are many redundant noise features and high collinearity.
Support Vector Machines have been successfully used to study classification
problems. In this paper we propose the PACS algorithm, a new procedure to
perform automatic variable selection for classification problems under such
conditions. The procedure simultaneously selects variables while grouping them
into predictive clusters, further aiding in the investigation of groups of
features with similar behavior. The procedure is shown to compare favorably
with existing methods in terms of both classification accuracy and dimension
reduction, while yielding the additional group structure information.
******************
3.
Speaker: Weihua Cao
Title: Improving
Efficiency and Robustness of the Doubly Robust Estimator for a Population Mean
with Incomplete Data
Abstract:
Considerable recent
interest has focused on so-called doubly robust estimators for a population
mean response in the presence of incomplete data, which involve models for both
the propensity score and the regression relationship between outcome and
covariates. The usual doubly robust estimator may yield severely biased
inferences if neither of these models is correctly specified and can exhibit nonnegligible bias if the estimated propensity score is
close to zero for some observations. We propose alternative doubly robust
estimators that achieve comparable or improved performance relative to existing
doubly robust estimators, even with some estimated propensity scores close to
zero.
Friday, September 12, 2008,------3:35:4:35
pm,-------321 Riddick
Refreshments will be served in the Riddick Reading
Room at 3:00pm.***NOTE:*** No
food or drink is allowed in any of the classrooms in Riddick Hall.