Fundamentals of Data Science: Theory and Practice
Author:: Jugal K. Kalita (Author), Dhruba K. Bhattacharyya (Author), Swarup Roy (Author)
Publisher finelybook 出版社: Academic Press
Edition 版次: 1st
Publication Date 出版日期: 2023-12-29
Language 语言: English
Print Length 页数: 334 pages
ISBN-10: 032391778X
ISBN-13: 9780323917780
Book Description
By finelybook
Fundamentals of Data Science: Theory and Practice presents basic and advanced concepts in data science along with real-life applications. The book provides students, researchers and professionals at different levels a good understanding of the concepts of data science, machine learning, data mining and analytics. Users will find the authors’ research experiences and achievements in data science applications, along with in-depth discussions on topics that are essential for data science projects, including pre-processing, that is carried out before applying predictive and descriptive data analysis tasks and proximity measures for numeric, categorical and mixed-type data.
The book’s authors include a systematic presentation of many predictive and descriptive learning algorithms, including recent developments that have successfully handled large datasets with high accuracy. In addition, a number of descriptive learning tasks are included.
Presents the foundational concepts of data science along with advanced concepts and real-life applications for applied learning
Includes coverage of a number of key topics such as data quality and pre-processing, proximity and validation, predictive data science, descriptive data science, ensemble learning, association rule mining, Big Data analytics, as well as incremental and distributed learning
Provides updates on key applications of data science techniques in areas such as Computational Biology, Network Intrusion Detection, Natural Language Processing, Software Clone Detection, Financial Data Analysis, and Scientific Time Series Data Analysis
Covers computer program code for implementing descriptive and predictive algorithms
Review
Presents the foundational concepts of data science through real-world examples and applications
From the Back Cover
Fundamentals of Data Science: Theory and Practicepresents basic and advanced concepts in data science along with real-life applications. The book provides students, researchers, and professionals at different levels a good understanding of the concepts of data science, machine learning, data mining, and analytics. Data science is an evolving area of study that is extensively used in solving real-life problems. It is not just about machine learning, statistics, or databases. Instead, it is a comprehensive study of a number of topics that help extract novel knowledge from data, starting with preparing the data, applying suitable intelligent learning models, and interpreting the outcome. The models applied are not “one-size-fits-all” and vary with the nature of the data and the applications under consideration. The authors provide discussions of theoretical as well as practical approaches in data science, with a goal to produce a solid understanding of data science which ultimately leads to novel knowledge discovery. Fundamentals of Data Science: Theory and Practice presents the authors’ research experiences and achievements in data science applications. The approach of this book is distinct because of the following clearly enumerated characteristics: The book containsan in-depth discussion on topics that are essential for data science projects, including pre-processing, carried out before applying predictive and descriptive data analysis tasks, and proximity measures for numeric, categorical and mixed-type data, without the knowledge of which it is impossible to develop learning algorithms that apply to a wide range of domains and applications.
The authors include a systematic presentation of many predictive and descriptive learning algorithms, including recent developments that have successfully handled large datasets with high accuracy. In addition, the authors present a number of descriptive learning tasks, including a dedicated chapter on predictive learning (or mining), as well as a wide range of applications,featuring Big Data mining as one of the emphasized topics. The authors discuss the strength and limitations of a number of methods for Big Data miningand also delve in-depth into ensemble learning techniques and analyze their pros and cons.
About the Author
Dr. Jugal Kalita received his BTech degree from the Indian Institute of Technology in Kharagpur, India, his MS degree from the University of Saskatchewan, Canada, and his MS and PhD degrees from the University of Pennsylvania. He is a Professor of Computer Science at the University of Colorado at Colorado Springs. His research interests include machine learning and its applications to areas such as natural language processing, intrusion detection, and bioinformatics. He is the author of more than 250 research articles in reputed conferences and journals and has authored four books, including Network Traffic Anomaly Detection and Prevention from Springer, Gene Expression Data Analysis: A Statistical and Machine Learning Perspective from Chapman and Hall/CRC Press, and Recent Developments in Machine Learning and Data Analytics from Springer. He has received multiple National Science Foundation (NSF) grants Dr. Dhruba K. Bhattacharyya received his PhD in Computer Science and Engineering from Tezpur University. Currently, he is a Senior Professor in the Department of Computer Science & Engineering, Tezpur University, and also the Dean of Academic Affairs. Dr. Bhattacharyya’s major research interests are Machine Learning, Cyber Security, and Bioinformatics, and in all these three fields his contributions are significant. Dr. Bhattacharyya has published more than 260 research articles in peer-reviewed international journals and selective conference proceedings. Dr. Bhattacharyya has authored/edited 18 reference books on machine learning and its applications, including Network Traffic Anomaly Detection and Prevention from Springer, Gene Expression Data Analysis: A Statistical and Machine Learning Perspective from Chapman and Hall/CRC Press, Data Mining Techniques and Its Application in Medical Imagery from VDM, and Clustering Techniques in Spatial Data Analysis from Lambert Academic Publishing. Dr. Bhattacharyya is on the review panel for most major research grants reviewed by the Department of Science & Technology, Government of India, and several international funding agencies. Machine learning research at Tezpur University, led by Dr. Bhattacharyya, has been recognized by the Indian Ministry of Education as a Centre of Excellence, one among twenty in India. Dr. Bhattacharyya is a fellow of the Institution of Electronics and Telecommunication Engineers, and Institution of Engineers, and is a Senior Member of IEEE. He is also on the Editorial/Advisory Boards of several international journals, conferences, and workshops Swarup Roy is a Professor in Computer Science at Sikkim (Central) University, Gangtok. He received his M.Tech. and PhD (Comp. Sc. & Engg.) from Tezpur (Central) University. He worked as a Post-Doctoral Fellow (PDF) at University of Colorado at Colorado Springs, USA and Indian Institute of Technology (IIT), Guwahati. His research interest includes Machine Learning, Data Science, Network Science, Intrusion Detection and Computational Biology. He has published 80+ research articles in high impact international journals and leading world conferences across the globe in machine learning and bioinformatics. He authored the book “Biological Network Analysis- Trends, Approaches, Graphical Theory and Algorithms published by Elsevier, USA . He was a recipient of Best Doctoral Thesis Award from IIT-Roorkee and University Gold Medal. He was selected for Overseas Research Associate Fellowship from DBT, Govt. of India in 2015 to conduct research in the foreign laboratories and funding from DST-SERB to visit SPAIN in 2012 to present his research paper. He taught undergraduate and graduate students of computer science at University of Colorado, USA as visiting professor. He has been listed as a data science subject expert by the Department of Science & Technology-Govt of India. He acted as Track Co-Chair for Biological Modelling at 8th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM BCB) 2017, Boston, USA. He is acting as Guest Editor of International Journals such as MDPI Data, MDPI Life, Frontier in Bioinformatics and in the technical committee of many reputed International Journals.