Vita of Susmita Datta    (last updated July, 2008)                                                        


 

Address:

 

Department of Bioinformatics & Biostatistics

School of Public Health and Information Sciences

University of Louisville

Louisville, KY 40292

(502) 852 0081 (phone)

(502) 852 3294 (fax)

 E-mail: susmita.datta@louisville.edu


Education:

            Dissertation Title: Dynamics of Cytonuclear Disequilibria and Related Statistical Tests for The Neutrality of Mitochondrial DNA markers for Hybrid Zone Data   (under the direction of Prof. Jonathan Arnold, Department of Genetics, University of Georgia, Athens)


 Positions Held:

 


Research Interests:

Bioinformatics, Proteomics, Infectious Disease Modeling, Statistical Genetics, Statistical Issues in Population Biology, Survival Analysis.


Professional/Editorial:

 Honor: 

  Member:  

  Editorial Services:  

  Reviewer: 

  Other:


Publications:

    Refereed Publications:  

  1. Datta, S., Fu, Y. X., Arnold, J. (1996). Dynamics and equilibrium behavior of cytonuclear disequilibria under genetic drift, mutation, and migration, Theoretical Population Biology, 50, 298-324.

 

  1. Datta, S. and Arnold, J. (1996). Diagnostics and a statistical test of neutrality hypothesis using the dynamics of cytonuclear disequilibria, Biometrics, 52, 1042-1054.

 

  1. Datta, S., Rand, D. M., and Arnold, J. (1996). A statistical test of a neutral model using the dynamics of cytonuclear disequilibria, Genetics, 144, 1985-1992.

 

  1. Longini, I. M., Datta, S., and Halloran, E. (1996). Measuring vaccine efficacy for both susceptibility to infection in infectiousness for prophylactic HIV-1 vaccines, Journal of Acquired Immune Deficiency Syndromes and Human Retrovirology, 13, 440-447.

 

  1. Datta, S., Longini, I. M., and Halloran, E. (1997). Measuring vaccine efficacy for different HIV vaccine trials, Statistics in Medicine, 17, 185-200.

 

  1. Datta, S. and Arnold, J. (1998). Dynamics of cytonuclear disequilibria in subdivided populations, Journal of Theoretical Biology, 192, 99-111.

 

  1. Scribner, K. T., Datta, S., Arnold, J., and Avise, J. C. (1999). Empirical evaluation of cytonuclear models incorporating genetic drift and tests for neutrality of mtDNA variants: data from experimental Gambusia hybrid zones, Genetica, 105, 101-108.

 

  1. Datta, S., Halloran, E. M. and Longini, I. M. (1999). Efficiency of estimating vaccine efficacy for susceptibility and infectiousness: randomization by individual versus household, Biometrics, 55, 792-798.

 

  1. Datta, S. (2000). Some statistical aspects of cytonuclear disequilibria. In Statistics in Molecular Biology and Genetics, Ed: Francoise Seillier-Moiseiwitsch, IMS Lecture Notes-Monograph Series, 33, 21-37.

 

  1. Datta, S., Satten, G. A. and Datta, S. (2000). Nonparametric estimation for the three stage irreversible illness-death model, Biometrics, 56, 841-847.

 

  1. Datta, S. (2000). Some statistical issues involving multi-generation cytonuclear data, In Advances on Methodological and Applied Aspects of Probability and Statistics, N. Balakrishnan, Ed., Gordon and Breach, 525-546.

 

  1. Datta, S., Satten, G. A. and Datta, S. (2000). Estimation of stage occupation probabilities in multistage models, In Advances on Theoretical and Methodological Aspects of Probability and Statistics, N. Balakrishnan, Ed., Gordon and Breach, 493-506.

 

  1. Datta, S. (2001). Estimation of selection parameters using multi-generation cytonuclear data, Biometrical Journal, 43, 219-233.

 

  1. Datta, S. (2001). Exploring relationships in gene expressions: A partial least squares approach, Gene Expression, 9, 257-264.

 

  1. Datta, S. (2001). Testing neutrality of mtDNA using multigeneration cytonuclear data, Selected Proceedings of the Symposium on Inference for Stochastic Processes, Eds.: I. V. Basawa, C. C. Heyde and R. L. Taylor,  IMS Lecture Notes - Monograph Series, 37, 173-184, IMS, Beachwood, OH.

 

  1. Datta, S. and Arnold, J. (2002). Some comparisons of clustering and classification techniques applied to transcriptional profiling data. In Advances in Statistics, Combinatorics and Related Areas, Eds.: C. Gulati, Y-X. Lin, S. Mishra, and J. Rayner, World Scientific, 63-74.

 

  1. Datta, S. (2003). Statistical techniques for microarray data: A partial overview, Communications in Statistics-Theory and Methods, 32, 263-280.

 

  1. Datta, S. and Datta, S.(2003) Comparisons and validation of statistical clustering techniques for microarray gene expression data,  Bioinformatics19,  459-466 (2003).  Web Supplement

 

  1. Arnold, J., Schuttler, H.-B.,Logan, D., Griffith, J., Arpinar, B. Datta, S., Kochut, K. J., Kraemer, E., Miller, J. A., Sheth, A., Aleman-Meza, B., Doss,  J., Harris, L. and Nyong, A. (2003).  Metabolomics,  In Handbook of Industrial Mycology, Chapter 23. Marcel-Dekker, New York, NY, (2003).

 

  1. G., Brehm, S., Datta, S., and Adams, M. W. W. (2003). Whole Genome DNA microarray of a hyperthermophile and an archaeon: Pyrococcus furious grown on peptides and carbohydrate, Journal of Bacteriology, 185, 3935-3947.

 

  1. Datta, S.,  Satten, G. A., Benos, D. J., Xia, J.,  Heslin, M., and Datta, S. (2004). An empirical Bayes adjustment to increase the sensitivity of detecting differentially expressed genes in microarray experiments, Bioinformatics, 20, 235-242.

 

  1. Datta, S. and Datta, S. (2004). An empirical Bayes adjustment to multiple p-values for the detection of differentially expressed genes in microarray experiments. In Bioinformatics 2004,  Conferences in Research and Practice in Information Technology - Second Asia-Pacific Bioinformatics Conference, 29,Y-P. P. Chen, Ed., 155-159, Australian Computer Society, Sydney.

 

  1. Warrenfeltz, Z., Pavlik, S., Datta, S., Kraemer, E., Benedict, B. Mcdonald, J. F. (2004).  Gene expression profiling of epithelial ovarian tumors corelated with malignant potential.  Molecular Cancer, 2004, 3:27.

 

  1.  Datta, S.  and Datta, S. (2005). Empirical Bayes screening (EBS) of many p-values with applications to microarray studies, Bioinformatics, 21, 1987-1994. 
  2. Weinberg, M. V., Schut, G. J., Brehm, S., Datta, S., and Adams, M. W. W.  (2005).  A hyperthermoplilic cold shock response: the archaeon Pyrococcus furiosus  synthesizes novel membrane-bound glycoproteins at a sub-optimal growth temperature. Journal of Bacteriology, 187, 336-348.
  3. Datta, S. and de Padilla, L.M. (2006). Feature selection and machine learning with mass spectrometry data for distinguishing cancer and non-cancer samples, Statistical Methodology (Special Issue on Bioinformatics), 3, 79-92.
  4. Datta, S.  and Datta, S. (2006). Validation measures for clustering algorithms incorporating biological information, IEEE Proceedings of International Multi-Symposiums on Computer and Computional Sciences (IMSCCS|06), (J. Ni, J. Dongarra, Y. Zheng, G. Gu, G. Wolfgang and H. Jin, Eds.), 1, 131-135.
  5.  Datta, S.  and Datta, S. (2006). Evaluation of clustering algorithms for gene expression data, BMC Bioinformatics, 7 (Suppl 4): S17.
  6. Datta, S.  and Datta, S. (2006).  Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes, BMC Bioinformatics, 7, 397.
  7. Boratyn, G. M., Datta, S. and Datta, S. (2006). Biologically supervised hierarchical clustering algorithms for gene expression data, Proceedings of the 28th IEEE  EMBS Annual International Conference, New York City, USA, 5515-5518.
  8. Datta, S., Le-Rademacher, J. and Datta, S. (2007). Predicting patient survival from microarray data by accelerated failure time modeling using partial least squares and LASSO, Biometrics, 63, 259-271.
  9. Datta, S., Datta, S., Parrish, R. S. and Thompson, C. M. (2007). Microarray data analysis. In Computational Methods in Biomedical Research, R. Khatree and D. Naik, eds., Chapman & Hall/CRC Biostatistics Series, Volume 24, 1-43.
  10. Boratyn, G. M., Datta, S. and Datta, S. (2007). Incorporation of biological knowledge into distance for clustering genes. Bioinformation, 1, 396-405.
  11. Pihur, V., Datta, S. and Datta, S. (2007). Weighted rank aggregation of cluster validation measures: A Monte Carlo cross-entropy approach. Bioinformatics,  23, 1607-1615.
  1. Pihur, V., Datta, S. and Datta, S. (2008). Finding cancer genes through meta-analysis of microarray experiments: Rank aggregation via the cross entropy algorithm. Genomics, to appear.  doi:10.1016/j.ygeno.2008.05.003  
  2. Pihur, V., Datta, S. and Datta, S. (2007). Understanding Chronic Fatigue Syndrome (CFS) from CAMDA data: A systems biology approach. Proceedings of CAMDA 2007, full paper, online @ http://camda.bioinfo.cipf.es/camda07/agenda/detailed.html.
  3.  Pihur, V., Brock, G., Datta, S. and Datta, S. (2008). Cluster validation for microarray data: An appraisal. In Multivariate Statistical Methods, ( A. SenGupta, ed), ISI Platinum Jubilee series, Vol 5, World Scientific Press, to appear (2008).
  4. Brock, G., Pihur, V., Datta, S. and Datta, S. (2008). clValid , an R package for cluster validation. Journal of Statistical Software, 25, 4.
  5. Pihur, V., Datta, S. and Datta, S. (2008). Reconstruction of genetic association networks from microarray data: A partial least squares approach. Bioinformatics,  24, 561-568.
  6.  Datta, S., Turner, D., Singh, R., Ruset, B., Pierce, W. M.,  and Knudsen, T. B. (2008). Fetal alcohol syndrome in mice detected through proteomics screening of the amniotic fluid. Birth Defects Research Part A: Clinical and Molecular Teratology, 82, 177-186.
  7. Datta, S. and Pihur, V. (2008). Feature selection and machine learning with mass spectrometry data, R. Matthiesen, ed., In Clinical Proteomics: Methods, Applications and Tools, Humana Press, to appear.

     Other Publications:

  1. Datta, S. (1999). Hypotheses testing for different selection models using multi-generation cytonuclear data, Proceedings of American Statistical Association, Biometrics Section, 157-161, Alexandria, USA.
  2. Datta, S. (2000). Book Review: Statistics in Human Genetics by Pak Sham. Statistics in Medicine, 19,1384-1385.
  3. Datta, S. (2005). Statistics in Genetics, In Encyclopedia of Statistical Sciences, Second edition, Wiley, New York.
  4. Datta, S. (2005). Statistics in Microarray Analysis, In Encyclopedia of Statistical Sciences, Second edition, Wiley, New York.
  5. Datta, S. (2005). Statistics in Vaccine Studies, In Encyclopedia of Statistical Sciences, Second edition, Wiley, New York.
  6. Datta, S. and Datta, S. (2006). Validation of statistical clustering using biological information, Proceedings of INTERFACE 2005 (CD-ROM).

 


Grants:

 

 


 

Honors/Awards/Press:

 

 


 

Presentations:  

 

Invited Talks at Professional/Research Meetings: