TY - BOOK UR - http://lib.ugent.be/catalog/ebk01:3710000000926148 ID - ebk01:3710000000926148 ET - 2nd ed. 2016. LA - eng TI - Statistical Learning from a Regression Perspective PY - 2016 SN - 9783319440484 AU - Berk, Richard A. author. AB - Statistical Learning as a Regression Problem -- Splines, Smoothers, and Kernels -- Classification and Regression Trees (CART) -- Bagging -- Random Forests -- Boosting -- Support Vector Machines -- Some Other Procedures Briefly -- Broader Implications and a Bit of Craft Lore. AB - This textbook considers statistical learning applications when interest centers on the conditional distribution of the response variable, given a set of predictors, and when it is important to characterize how the predictors are related to the response. As a first approximation, this can be seen as an extension of nonparametric regression. This fully revised new edition includes important developments over the past 8 years. Consistent with modern data analytics, it emphasizes that a proper statistical learning data analysis derives from sound data collection, intelligent data management, appropriate statistical procedures, and an accessible interpretation of results. A continued emphasis on the implications for practice runs through the text. Among the statistical learning procedures examined are bagging, random forests, boosting, support vector machines and neural networks. Response variables may be quantitative or categorical. As in the first edition, a unifying theme is supervised learning that can be treated as a form of regression analysis. Key concepts and procedures are illustrated with real applications, especially those with practical implications. A principal instance is the need to explicitly take into account asymmetric costs in the fitting process. For example, in some situations false positives may be far less costly than false negatives. Also provided is helpful craft lore such as not automatically ceding data analysis decisions to a fitting algorithm. In many settings, subject-matter knowledge should trump formal fitting criteria. Yet another important message is to appreciate the limitation of one’s data and not apply statistical learning procedures that require more than the data can provide. The material is written for upper undergraduate level and graduate students in the social and life sciences and for researchers who want to apply statistical learning procedures to scientific and policy problems. The author uses this book in a course on modern regression for the social, behavioral, and biological sciences. Intuitive explanations and visual representations are prominent. All of the analyses included are done in R with code routinely provided. ER -Download RIS file
04387nam a22005415i 4500 | |||
001 | 978-3-319-44048-4 | ||
003 | DE-He213 | ||
005 | 20161027120457.0 | ||
007 | cr nn 008mamaa | ||
008 | 161027s2016 gw | s |||| 0|eng d | ||
020 | a 9783319440484 9 978-3-319-44048-4 | ||
024 | 7 | a 10.1007/978-3-319-44048-4 2 doi | |
050 | 4 | a QA276-280 | |
072 | 7 | a PBT 2 bicssc | |
072 | 7 | a MAT029000 2 bisacsh | |
082 | 4 | a 519.5 2 23 | |
100 | 1 | a Berk, Richard A. e author. | |
245 | 1 | a Statistical Learning from a Regression Perspective h [electronic resource] / c by Richard A. Berk. | |
250 | a 2nd ed. 2016. | ||
264 | 1 | a Cham : b Springer International Publishing : b Imprint: Springer, c 2016. | |
300 | a XXIII, 347 p. 120 illus., 91 illus. in color. b online resource. | ||
336 | a text b txt 2 rdacontent | ||
337 | a computer b c 2 rdamedia | ||
338 | a online resource b cr 2 rdacarrier | ||
347 | a text file b PDF 2 rda | ||
490 | 1 | a Springer Texts in Statistics, x 1431-875X | |
505 | a Statistical Learning as a Regression Problem -- Splines, Smoothers, and Kernels -- Classification and Regression Trees (CART) -- Bagging -- Random Forests -- Boosting -- Support Vector Machines -- Some Other Procedures Briefly -- Broader Implications and a Bit of Craft Lore. | ||
520 | a This textbook considers statistical learning applications when interest centers on the conditional distribution of the response variable, given a set of predictors, and when it is important to characterize how the predictors are related to the response. As a first approximation, this can be seen as an extension of nonparametric regression. This fully revised new edition includes important developments over the past 8 years. Consistent with modern data analytics, it emphasizes that a proper statistical learning data analysis derives from sound data collection, intelligent data management, appropriate statistical procedures, and an accessible interpretation of results. A continued emphasis on the implications for practice runs through the text. Among the statistical learning procedures examined are bagging, random forests, boosting, support vector machines and neural networks. Response variables may be quantitative or categorical. As in the first edition, a unifying theme is supervised learning that can be treated as a form of regression analysis. Key concepts and procedures are illustrated with real applications, especially those with practical implications. A principal instance is the need to explicitly take into account asymmetric costs in the fitting process. For example, in some situations false positives may be far less costly than false negatives. Also provided is helpful craft lore such as not automatically ceding data analysis decisions to a fitting algorithm. In many settings, subject-matter knowledge should trump formal fitting criteria. Yet another important message is to appreciate the limitation of one’s data and not apply statistical learning procedures that require more than the data can provide. The material is written for upper undergraduate level and graduate students in the social and life sciences and for researchers who want to apply statistical learning procedures to scientific and policy problems. The author uses this book in a course on modern regression for the social, behavioral, and biological sciences. Intuitive explanations and visual representations are prominent. All of the analyses included are done in R with code routinely provided. | ||
650 | a Statistics. | ||
650 | a Public health. | ||
650 | a Probabilities. | ||
650 | a Social sciences. | ||
650 | a Psychology x Methodology. | ||
650 | a Psychological measurement. | ||
650 | 1 | 4 | a Statistics. |
650 | 2 | 4 | a Statistical Theory and Methods. |
650 | 2 | 4 | a Probability Theory and Stochastic Processes. |
650 | 2 | 4 | a Statistics for Social Science, Behavorial Science, Education, Public Policy, and Law. |
650 | 2 | 4 | a Public Health. |
650 | 2 | 4 | a Psychological Methods/Evaluation. |
650 | 2 | 4 | a Methodology of the Social Sciences. |
710 | 2 | a SpringerLink (Online service) | |
773 | t Springer eBooks | ||
776 | 8 | i Printed edition: z 9783319440477 | |
830 | a Springer Texts in Statistics, x 1431-875X | ||
856 | 4 | u http://dx.doi.org/10.1007/978-3-319-44048-4 | |
912 | a ZDB-2-SMA | ||
950 | a Mathematics and Statistics (Springer-11649) |
All data below are available with an Open Data Commons Open Database License. You are free to copy, distribute and use the database; to produce works from the database; to modify, transform and build upon the database. As long as you attribute the data sets to the source, publish your adapted database with ODbL license, and keep the dataset open (don't use technical measures such as DRM to restrict access to the database).
The datasets are also available as weekly exports.