Analysis of Categorical Data with R

Author: Christopher R. Bilder,Thomas M. Loughin

Publisher: CRC Press

ISBN: 1439855676

Category: Mathematics

Page: 547

View: 7070

DOWNLOAD NOW »

Learn How to Properly Analyze Categorical Data Analysis of Categorical Data with R presents a modern account of categorical data analysis using the popular R software. It covers recent techniques of model building and assessment for binary, multicategory, and count response variables and discusses fundamentals, such as odds ratio and probability estimation. The authors give detailed advice and guidelines on which procedures to use and why to use them. The Use of R as Both a Data Analysis Method and a Learning Tool Requiring no prior experience with R, the text offers an introduction to the essential features and functions of R. It incorporates numerous examples from medicine, psychology, sports, ecology, and other areas, along with extensive R code and output. The authors use data simulation in R to help readers understand the underlying assumptions of a procedure and then to evaluate the procedure’s performance. They also present many graphical demonstrations of the features and properties of various analysis methods. Web Resource The data sets and R programs from each example are available at www.chrisbilder.com/categorical. The programs include code used to create every plot and piece of output. Many of these programs contain code to demonstrate additional features or to perform more detailed analyses than what is in the text. Designed to be used in tandem with the book, the website also uniquely provides videos of the authors teaching a course on the subject. These videos include live, in-class recordings, which instructors may find useful in a blended or flipped classroom setting. The videos are also suitable as a substitute for a short course.

Discrete Data Analysis with R

Visualization and Modeling Techniques for Categorical and Count Data

Author: Michael Friendly,David Meyer

Publisher: CRC Press

ISBN: 1498725856

Category: Mathematics

Page: 544

View: 8009

DOWNLOAD NOW »

An Applied Treatment of Modern Graphical Methods for Analyzing Categorical Data Discrete Data Analysis with R: Visualization and Modeling Techniques for Categorical and Count Data presents an applied treatment of modern methods for the analysis of categorical data, both discrete response data and frequency data. It explains how to use graphical methods for exploring data, spotting unusual features, visualizing fitted models, and presenting results. The book is designed for advanced undergraduate and graduate students in the social and health sciences, epidemiology, economics, business, statistics, and biostatistics as well as researchers, methodologists, and consultants who can use the methods with their own data and analyses. Along with describing the necessary statistical theory, the authors illustrate the practical application of the techniques to a large number of substantive problems, including how to organize data, conduct an analysis, produce informative graphs, and evaluate what the graphs reveal about the data. The first part of the book contains introductory material on graphical methods for discrete data, basic R skills, and methods for fitting and visualizing one-way discrete distributions. The second part focuses on simple, traditional nonparametric tests and exploratory methods for visualizing patterns of association in two-way and larger frequency tables. The final part of the text discusses model-based methods for the analysis of discrete data. Web Resource The data sets and R software used, including the authors’ own vcd and vcdExtra packages, are available at http://cran.r-project.org.

Applied Categorical and Count Data Analysis

Author: Wan Tang,Hua He,Xin M. Tu

Publisher: CRC Press

ISBN: 1439806241

Category: Mathematics

Page: 384

View: 9227

DOWNLOAD NOW »

Developed from the authors’ graduate-level biostatistics course, Applied Categorical and Count Data Analysis explains how to perform the statistical analysis of discrete data, including categorical and count outcomes. The authors describe the basic ideas underlying each concept, model, and approach to give readers a good grasp of the fundamentals of the methodology without using rigorous mathematical arguments. The text covers classic concepts and popular topics, such as contingency tables, logistic models, and Poisson regression models, along with modern areas that include models for zero-modified count outcomes, parametric and semiparametric longitudinal data analysis, reliability analysis, and methods for dealing with missing values. R, SAS, SPSS, and Stata programming codes are provided for all the examples, enabling readers to immediately experiment with the data in the examples and even adapt or extend the codes to fit data from their own studies. Designed for a one-semester course for graduate and senior undergraduate students in biostatistics, this self-contained text is also suitable as a self-learning guide for biomedical and psychosocial researchers. It will help readers analyze data with discrete variables in a wide range of biomedical and psychosocial research fields.

Linear Models with R, Second Edition

Author: Julian J. Faraway

Publisher: CRC Press

ISBN: 1439887330

Category: Mathematics

Page: 286

View: 5866

DOWNLOAD NOW »

A Hands-On Way to Learning Data Analysis Part of the core of statistics, linear models are used to make predictions and explain the relationship between the response and the predictors. Understanding linear models is crucial to a broader competence in the practice of statistics. Linear Models with R, Second Edition explains how to use linear models in physical science, engineering, social science, and business applications. The book incorporates several improvements that reflect how the world of R has greatly expanded since the publication of the first edition. New to the Second Edition Reorganized material on interpreting linear models, which distinguishes the main applications of prediction and explanation and introduces elementary notions of causality Additional topics, including QR decomposition, splines, additive models, Lasso, multiple imputation, and false discovery rates Extensive use of the ggplot2 graphics package in addition to base graphics Like its widely praised, best-selling predecessor, this edition combines statistics and R to seamlessly give a coherent exposition of the practice of linear modeling. The text offers up-to-date insight on essential data analysis topics, from estimation, inference, and prediction to missing data, factorial models, and block designs. Numerous examples illustrate how to apply the different methods using R.

Applied Survey Data Analysis

Author: Steven G. Heeringa,Brady T. West,Patricia A. Berglund

Publisher: CRC Press

ISBN: 9781420080674

Category: Mathematics

Page: 487

View: 7275

DOWNLOAD NOW »

Taking a practical approach that draws on the authors’ extensive teaching, consulting, and research experiences, Applied Survey Data Analysis provides an intermediate-level statistical overview of the analysis of complex sample survey data. It emphasizes methods and worked examples using available software procedures while reinforcing the principles and theory that underlie those methods. After introducing a step-by-step process for approaching a survey analysis problem, the book presents the fundamental features of complex sample designs and shows how to integrate design characteristics into the statistical methods and software for survey estimation and inference. The authors then focus on the methods and models used in analyzing continuous, categorical, and count-dependent variables; event history; and missing data problems. Some of the techniques discussed include univariate descriptive and simple bivariate analyses, the linear regression model, generalized linear regression modeling methods, the Cox proportional hazards model, discrete time models, and the multiple imputation analysis method. The final chapter covers new developments in survey applications of advanced statistical techniques, including model-based analysis approaches. Designed for readers working in a wide array of disciplines who use survey data in their work, this book also provides a useful framework for integrating more in-depth studies of the theory and methods of survey data analysis. A guide to the applied statistical analysis and interpretation of survey data, it contains many examples and practical exercises based on major real-world survey data sets. Although the authors use Stata for most examples in the text, they offer SAS, SPSS, SUDAAN, R, WesVar, IVEware, and Mplus software code for replicating the examples on the book’s website: http://www.isr.umich.edu/src/smp/asda/

Understanding Advanced Statistical Methods

Author: Peter Westfall,Kevin S. S. Henning

Publisher: CRC Press

ISBN: 1466512105

Category: Mathematics

Page: 569

View: 2012

DOWNLOAD NOW »

Providing a much-needed bridge between elementary statistics courses and advanced research methods courses, Understanding Advanced Statistical Methods helps students grasp the fundamental assumptions and machinery behind sophisticated statistical topics, such as logistic regression, maximum likelihood, bootstrapping, nonparametrics, and Bayesian methods. The book teaches students how to properly model, think critically, and design their own studies to avoid common errors. It leads them to think differently not only about math and statistics but also about general research and the scientific method. With a focus on statistical models as producers of data, the book enables students to more easily understand the machinery of advanced statistics. It also downplays the "population" interpretation of statistical models and presents Bayesian methods before frequentist ones. Requiring no prior calculus experience, the text employs a "just-in-time" approach that introduces mathematical topics, including calculus, where needed. Formulas throughout the text are used to explain why calculus and probability are essential in statistical modeling. The authors also intuitively explain the theory and logic behind real data analysis, incorporating a range of application examples from the social, economic, biological, medical, physical, and engineering sciences. Enabling your students to answer the why behind statistical methods, this text teaches them how to successfully draw conclusions when the premises are flawed. It empowers them to use advanced statistical methods with confidence and develop their own statistical recipes. Ancillary materials are available on the book’s website.

Beyond ANOVA

Basics of Applied Statistics

Author: Rupert G. Miller, Jr.

Publisher: CRC Press

ISBN: 9780412070112

Category: Mathematics

Page: 336

View: 6235

DOWNLOAD NOW »

Renowned statistician R.G. Miller set the pace for statistics students with Beyond ANOVA: Basics of Applied Statistics. Designed to show students how to work with a set of "real world data," Miller's text goes beyond any specific discipline, and considers a whole variety of techniques from ANOVA to empirical Bayes methods; the jackknife, bootstrap methods; and the James-Stein estimator. This reissue of Miller's classic book has been revised by professors at Stanford University, California. As before, one of the main strengths of Beyond ANOVA is its promotion of the use of the most straightforward data analysis methods-giving students a viable option, instead of resorting to complicated and unnecessary tests. Assuming a basic background in statistics, Beyond ANOVA is written for undergraduates and graduate statistics students. Its approach will also be valued by biologists, social scientists, engineers, and anyone who may wish to handle their own data analysis.

Exploratory Multivariate Analysis by Example Using R

Author: Francois Husson,Sebastien Le,Jérôme Pagès

Publisher: CRC Press

ISBN: 1439835810

Category: Mathematics

Page: 240

View: 3790

DOWNLOAD NOW »

Full of real-world case studies and practical advice, Exploratory Multivariate Analysis by Example Using R focuses on four fundamental methods of multivariate exploratory data analysis that are most suitable for applications. It covers principal component analysis (PCA) when variables are quantitative, correspondence analysis (CA) and multiple correspondence analysis (MCA) when variables are categorical, and hierarchical cluster analysis. The authors take a geometric point of view that provides a unified vision for exploring multivariate data tables. Within this framework, they present the principles, indicators, and ways of representing and visualizing objects that are common to the exploratory methods. The authors show how to use categorical variables in a PCA context in which variables are quantitative, how to handle more than two categorical variables in a CA context in which there are originally two variables, and how to add quantitative variables in an MCA context in which variables are categorical. They also illustrate the methods and the ways they can be exploited using examples from various fields. Throughout the text, each result correlates with an R command accessible in the FactoMineR package developed by the authors. All of the data sets and code are available at http://factominer.free.fr/book By using the theory, examples, and software presented in this book, readers will be fully equipped to tackle real-life multivariate data.

Bayesian Ideas and Data Analysis

An Introduction for Scientists and Statisticians

Author: Ronald Christensen,Wesley Johnson,Adam Branscum,Timothy E Hanson

Publisher: CRC Press

ISBN: 1439803552

Category: Mathematics

Page: 516

View: 5855

DOWNLOAD NOW »

Emphasizing the use of WinBUGS and R to analyze real data, Bayesian Ideas and Data Analysis: An Introduction for Scientists and Statisticians presents statistical tools to address scientific questions. It highlights foundational issues in statistics, the importance of making accurate predictions, and the need for scientists and statisticians to collaborate in analyzing data. The WinBUGS code provided offers a convenient platform to model and analyze a wide range of data. The first five chapters of the book contain core material that spans basic Bayesian ideas, calculations, and inference, including modeling one and two sample data from traditional sampling models. The text then covers Monte Carlo methods, such as Markov chain Monte Carlo (MCMC) simulation. After discussing linear structures in regression, it presents binomial regression, normal regression, analysis of variance, and Poisson regression, before extending these methods to handle correlated data. The authors also examine survival analysis and binary diagnostic testing. A complementary chapter on diagnostic testing for continuous outcomes is available on the book’s website. The last chapter on nonparametric inference explores density estimation and flexible regression modeling of mean functions. The appropriate statistical analysis of data involves a collaborative effort between scientists and statisticians. Exemplifying this approach, Bayesian Ideas and Data Analysis focuses on the necessary tools and concepts for modeling and analyzing scientific data. Data sets and codes are provided on a supplemental website.

Analysis of Incomplete Multivariate Data

Author: J.L. Schafer

Publisher: CRC Press

ISBN: 9781439821862

Category: Mathematics

Page: 448

View: 2178

DOWNLOAD NOW »

The last two decades have seen enormous developments in statistical methods for incomplete data. The EM algorithm and its extensions, multiple imputation, and Markov Chain Monte Carlo provide a set of flexible and reliable tools from inference in large classes of missing-data problems. Yet, in practical terms, those developments have had surprisingly little impact on the way most data analysts handle missing values on a routine basis. Analysis of Incomplete Multivariate Data helps bridge the gap between theory and practice, making these missing-data tools accessible to a broad audience. It presents a unified, Bayesian approach to the analysis of incomplete multivariate data, covering datasets in which the variables are continuous, categorical, or both. The focus is applied, where necessary, to help readers thoroughly understand the statistical properties of those methods, and the behavior of the accompanying algorithms. All techniques are illustrated with real data examples, with extended discussion and practical advice. All of the algorithms described in this book have been implemented by the author for general use in the statistical languages S and S Plus. The software is available free of charge on the Internet.

A Handbook of Statistical Analyses using R, Third Edition

Author: Torsten Hothorn,Brian S. Everitt

Publisher: CRC Press

ISBN: 1482204584

Category: Mathematics

Page: 456

View: 7301

DOWNLOAD NOW »

Like the best-selling first two editions, A Handbook of Statistical Analyses using R, Third Edition provides an up-to-date guide to data analysis using the R system for statistical computing. The book explains how to conduct a range of statistical analyses, from simple inference to recursive partitioning to cluster analysis. New to the Third Edition Three new chapters on quantile regression, missing values, and Bayesian inference Extra material in the logistic regression chapter that describes a regression model for ordered categorical response variables Additional exercises More detailed explanations of R code New section in each chapter summarizing the results of the analyses Updated version of the HSAUR package (HSAUR3), which includes some slides that can be used in introductory statistics courses Whether you’re a data analyst, scientist, or student, this handbook shows you how to easily use R to effectively evaluate your data. With numerous real-world examples, it emphasizes the practical application and interpretation of results.

Statistical Analysis of Questionnaires

A Unified Approach Based on R and Stata

Author: Francesco Bartolucci,Silvia Bacci,Michela Gnaldi

Publisher: CRC Press

ISBN: 146656850X

Category: Mathematics

Page: 328

View: 4747

DOWNLOAD NOW »

Statistical Analysis of Questionnaires: A Unified Approach Based on R and Stata presents special statistical methods for analyzing data collected by questionnaires. The book takes an applied approach to testing and measurement tasks, mirroring the growing use of statistical methods and software in education, psychology, sociology, and other fields. It is suitable for graduate students in applied statistics and psychometrics and practitioners in education, health, and marketing. The book covers the foundations of classical test theory (CTT), test reliability, validity, and scaling as well as item response theory (IRT) fundamentals and IRT for dichotomous and polytomous items. The authors explore the latest IRT extensions, such as IRT models with covariates, multidimensional IRT models, IRT models for hierarchical and longitudinal data, and latent class IRT models. They also describe estimation methods and diagnostics, including graphical diagnostic tools, parametric and nonparametric tests, and differential item functioning. Stata and R software codes are included for each method. To enhance comprehension, the book employs real datasets in the examples and illustrates the software outputs in detail. The datasets are available on the authors’ web page.

Exploratory Data Analysis with MATLAB, Third Edition

Author: Wendy L. Martinez,Angel R. Martinez,Jeffrey Solka

Publisher: CRC Press

ISBN: 1315349841

Category: Mathematics

Page: 590

View: 5466

DOWNLOAD NOW »

Praise for the Second Edition: "The authors present an intuitive and easy-to-read book. ... accompanied by many examples, proposed exercises, good references, and comprehensive appendices that initiate the reader unfamiliar with MATLAB." —Adolfo Alvarez Pinto, International Statistical Review "Practitioners of EDA who use MATLAB will want a copy of this book. ... The authors have done a great service by bringing together so many EDA routines, but their main accomplishment in this dynamic text is providing the understanding and tools to do EDA. —David A Huckaby, MAA Reviews Exploratory Data Analysis (EDA) is an important part of the data analysis process. The methods presented in this text are ones that should be in the toolkit of every data scientist. As computational sophistication has increased and data sets have grown in size and complexity, EDA has become an even more important process for visualizing and summarizing data before making assumptions to generate hypotheses and models. Exploratory Data Analysis with MATLAB, Third Edition presents EDA methods from a computational perspective and uses numerous examples and applications to show how the methods are used in practice. The authors use MATLAB code, pseudo-code, and algorithm descriptions to illustrate the concepts. The MATLAB code for examples, data sets, and the EDA Toolbox are available for download on the book’s website. New to the Third Edition Random projections and estimating local intrinsic dimensionality Deep learning autoencoders and stochastic neighbor embedding Minimum spanning tree and additional cluster validity indices Kernel density estimation Plots for visualizing data distributions, such as beanplots and violin plots A chapter on visualizing categorical data

Analysis of Ordinal Categorical Data

Author: Alan Agresti

Publisher: John Wiley & Sons

ISBN: 1118209990

Category: Mathematics

Page: 424

View: 4900

DOWNLOAD NOW »

Statistical science’s first coordinated manual of methods for analyzing ordered categorical data, now fully revised and updated, continues to present applications and case studies in fields as diverse as sociology, public health, ecology, marketing, and pharmacy. Analysis of Ordinal Categorical Data, Second Edition provides an introduction to basic descriptive and inferential methods for categorical data, giving thorough coverage of new developments and recent methods. Special emphasis is placed on interpretation and application of methods including an integrated comparison of the available strategies for analyzing ordinal data. Practitioners of statistics in government, industry (particularly pharmaceutical), and academia will want this new edition.

The R Book

Author: Michael J. Crawley

Publisher: John Wiley & Sons

ISBN: 1118448960

Category: Mathematics

Page: 1080

View: 5852

DOWNLOAD NOW »

Hugely successful and popular text presenting an extensive and comprehensive guide for all R users The R language is recognized as one of the most powerful and flexible statistical software packages, enabling users to apply many statistical techniques that would be impossible without such software to help implement such large data sets. R has become an essential tool for understanding and carrying out research. This edition: Features full colour text and extensive graphics throughout. Introduces a clear structure with numbered section headings to help readers locate information more efficiently. Looks at the evolution of R over the past five years. Features a new chapter on Bayesian Analysis and Meta-Analysis. Presents a fully revised and updated bibliography and reference section. Is supported by an accompanying website allowing examples from the text to be run by the user. Praise for the first edition: ‘…if you are an R user or wannabe R user, this text is the one that should be on your shelf. The breadth of topics covered is unsurpassed when it comes to texts on data analysis in R.’ (The American Statistician, August 2008) ‘The High-level software language of R is setting standards in quantitative analysis. And now anybody can get to grips with it thanks to The R Book…’ (Professional Pensions, July 2007)

Introduction to Statistical Data Analysis for the Life Sciences

Author: Claus Thorn Ekstrom,Helle Sørensen

Publisher: CRC Press

ISBN: 1439825556

Category: Mathematics

Page: 428

View: 2185

DOWNLOAD NOW »

Any practical introduction to statistics in the life sciences requires a focus on applications and computational statistics combined with a reasonable level of mathematical rigor. It must offer the right combination of data examples, statistical theory, and computing required for analysis today. And it should involve R software, the lingua franca of statistical computing. Introduction to Statistical Data Analysis for the Life Sciences covers all the usual material but goes further than other texts to emphasize: Both data analysis and the mathematics underlying classical statistical analysis Modeling aspects of statistical analysis with added focus on biological interpretations Applications of statistical software in analyzing real-world problems and data sets Developed from their courses at the University of Copenhagen, the authors imbue readers with the ability to model and analyze data early in the text and then gradually fill in the blanks with needed probability and statistics theory. While the main text can be used with any statistical software, the authors encourage a reliance on R. They provide a short tutorial for those new to the software and include R commands and output at the end of each chapter. Data sets used in the book are available on a supporting website. Each chapter contains a number of exercises, half of which can be done by hand. The text also contains ten case exercises where readers are encouraged to apply their knowledge to larger data sets and learn more about approaches specific to the life sciences. Ultimately, readers come away with a computational toolbox that enables them to perform actual analysis for real data sets as well as the confidence and skills to undertake more sophisticated analyses as their careers progress.

Modelling Survival Data in Medical Research, Second Edition

Author: David Collett

Publisher: CRC Press

ISBN: 1584883251

Category: Mathematics

Page: 410

View: 6285

DOWNLOAD NOW »

Critically acclaimed and resoundingly popular in its first edition, Modelling Survival Data in Medical Research has been thoroughly revised and updated to reflect the many developments and advances--particularly in software--made in the field over the last 10 years. Now, more than ever, it provides an outstanding text for upper-level and graduate courses in survival analysis, biostatistics, and time-to-event analysis.The treatment begins with an introduction to survival analysis and a description of four studies that lead to survival data. Subsequent chapters then use those data sets and others to illustrate the various analytical techniques applicable to such data, including the Cox regression model, the Weibull proportional hazards model, and others. This edition features a more detailed treatment of topics such as parametric models, accelerated failure time models, and analysis of interval-censored data. The author also focuses the software section on the use of SAS, summarising the methods used by the software to generate its output and examining that output in detail. Profusely illustrated with examples and written in the author's trademark, easy-to-follow style, Modelling Survival Data in Medical Research, Second Edition is a thorough, practical guide to survival analysis that reflects current statistical practices.

Probability and Statistics with R

Author: Maria Dolores Ugarte,Ana F. Militino,Alan T. Arnholt

Publisher: CRC Press

ISBN: 158488892X

Category: Mathematics

Page: 726

View: 3914

DOWNLOAD NOW »

Designed for an intermediate undergraduate course, Probability and Statistics with R shows students how to solve various statistical problems using both parametric and nonparametric techniques via the open source software R. It provides numerous real-world examples, carefully explained proofs, end-of-chapter problems, and illuminating graphs to facilitate hands-on learning. Integrating theory with practice, the text briefly introduces the syntax, structures, and functions of the S language, before covering important graphically and numerically descriptive methods. The next several chapters elucidate probability and random variables topics, including univariate and multivariate distributions. After exploring sampling distributions, the authors discuss point estimation, confidence intervals, hypothesis testing, and a wide range of nonparametric methods. With a focus on experimental design, the book also presents fixed- and random-effects models as well as randomized block and two-factor factorial designs. The final chapter describes simple and multiple regression analyses. Demonstrating that R can be used as a powerful teaching aid, this comprehensive text presents extensive treatments of data analysis using parametric and nonparametric techniques. It effectively links statistical concepts with R procedures, enabling the application of the language to the vast world of statistics.

Modern Data Science with R

Author: Benjamin S. Baumer,Daniel T. Kaplan,Nicholas J. Horton

Publisher: CRC Press

ISBN: 1498724493

Category: Law

Page: 556

View: 8470

DOWNLOAD NOW »

Modern Data Science with R is a comprehensive data science textbook for undergraduates that incorporates statistical and computational thinking to solve real-world problems with data. Rather than focus exclusively on case studies or programming syntax, this book illustrates how statistical programming in the state-of-the-art R/RStudio computing environment can be leveraged to extract meaningful information from a variety of data in the service of addressing compelling statistical questions. Contemporary data science requires a tight integration of knowledge from statistics, computer science, mathematics, and a domain of application. This book will help readers with some background in statistics and modest prior experience with coding develop and practice the appropriate skills to tackle complex data science projects. The book features a number of exercises and has a flexible organization conducive to teaching a variety of semester courses.

Exact Analysis of Discrete Data

Author: Karim F. Hirji

Publisher: CRC Press

ISBN: 9781420036190

Category: Mathematics

Page: 552

View: 1970

DOWNLOAD NOW »

Researchers in fields ranging from biology and medicine to the social sciences, law, and economics regularly encounter variables that are discrete or categorical in nature. While there is no dearth of books on the analysis and interpretation of such data, these generally focus on large sample methods. When sample sizes are not large or the data are otherwise sparse, exact methods--methods not based on asymptotic theory--are more accurate and therefore preferable. This book introduces the statistical theory, analysis methods, and computation techniques for exact analysis of discrete data. After reviewing the relevant discrete distributions, the author develops the exact methods from the ground up in a conceptually integrated manner. The topics covered range from univariate discrete data analysis, a single and several 2 x 2 tables, a single and several 2 x K tables, incidence density and inverse sampling designs, unmatched and matched case -control studies, paired binary and trinomial response models, and Markov chain data. While most chapters focus on statistical theory and applications, three chapters deal exclusively with computational issues. Detailed worked examples appear throughout the book, and each chapter includes an extensive problem set. Written at an elementary to intermediate level, Exact Analysis of Discrete Data is accessible to anyone having taken a basic course in statistics or biostatistics, bringing to them valuable material previously buried in specialized journals.