Data-Centric Machine Learning with Python: The ultimate guide to engineering and deploying high-quality models based on good data

Data-Centric Machine Learning with Python: The ultimate guide to engineering and deploying high-quality models based on good data
by: Jonas Christensen (Author),Nakul Bajaj(Author),Manmohan Gosada(Author)&0more
Publisher:Packt Publishing
Publication Date: 29 Feb. 2024
Language:English
Print Length:378 pages
ISBN-10:1804618128
ISBN-13:9781804618127


Book Description
Join the data-centric revolution and master the concepts, techniques, and algorithms shaping the future of AI and ML development, using Python
Key FeaturesGrasp the principles of data centricity and apply them to real-world scenariosGain experience with quality data collection, labeling, and synthetic data creation using PythonDevelop essential skills for building reliable, responsible, and ethical machine learning solutionsPurchase of the print or Kindle book includes a free PDF eBook
Book DescriptionIn the rapidly advancing data-driven world where data quality is pivotal to the success of machine learning and artificial intelligence projects, this critically timed guide provides a rare, end-to-end overview of data-centric machine learning (DCML), along with hands-on applications of technical and non-technical approaches to generating deeper and more accurate datasets.This book will help you understand what data-centric ML/AI is and how it can help you to realize the potential of 'small data'. Delving into the building blocks of data-centric ML/AI, you'll explore the human aspects of data labeling, tackle ambiguity in labeling, and understand the role of synthetic data. From strategies to improve data collection to techniques for refining and augmenting datasets, you'll learn everything you need to elevate your data-centric practices. Through applied examples and insights for overcoming challenges, you'll get a roadmap for implementing data-centric ML/AI in diverse applications in Python.By the end of this book, you'll have developed a profound understanding of data-centric ML/AI and the proficiency to seamlessly integrate common data-centric approaches in the model development lifecycle to unlock the full potential of your machine learning projects by prioritizing data quality and reliability.
What you will learnUnderstand the impact of input data quality compared to model selection and tuningRecognize the crucial role of subject-matter experts in effective model developmentImplement data cleaning, labeling, and augmentation best practicesExplore common synthetic data generation techniques and their applicationsApply synthetic data generation techniques using common Python packagesDetect and mitigate bias in a dataset using best-practice techniquesUnderstand the importance of reliability, responsibility, and ethical considerations in ML/AI
Who this book is forThis book is for data science professionals and machine learning enthusiasts looking to understand the concept of data-centricity, its benefits over a model-centric approach, and the practical application of a best-practice data-centric approach in their work. This book is also for other data professionals and senior leaders who want to explore the tools and techniques to improve data quality and create opportunities for small data ML/AI in their organizations.
Table of contentsExploring Data-Centric Machine LearningFrom Model-Centric to Data-Centric - ML's EvolutionPrinciples of Data-Centric MLData Labeling Is a Collaborative ProcessTechniques for Data CleaningTechniques for Programmatic Labeling in Machine LearningUsing Synthetic Data in Data-Centric Machine LearningTechniques for Identifying and Removing BiasDealing with Edge Cases and Rare Events in Machine LearningKick-Starting Your Journey in Data-Centric Machine Learning

About the Author
Review “I staunchly believe in the power of quality data to deliver outstanding solutions to business problems. The book had me at this: “Data quality is more important than data volume when it comes to building highly informative models.” Kudos to the authors for writing this excellent resource for machine learning practitioners, addressing the challenges encountered in the development of ML models, such as the shortcomings of model-centric development and annotations, data availability, and noisy data.” --Dr. Shantha Mohan, Author and Co-Founder of Retail Solutions
About the Author Jonas Christensen has spent his career leading data science functions across multiple industries. He is an international keynote speaker, postgraduate educator, and advisor in the fields of data science, analytics leadership, and machine learning and host of the Leaders of Analytics podcast.Nakul Bajaj is a data scientist, MLOps engineer, educator and mentor, helping students and junior engineers navigate their data journey. He has a strong passion for MLOps, with a focus on reducing complexity and delivering value from machine learning use-cases in business and healthcare.Manmohan Gosada is a seasoned professional with a proven track record in the dynamic field of data science. With a comprehensive background spanning various data science functions and industries, Manmohan has emerged as a leader in driving innovation and delivering impactful solutions. He has successfully led large-scale data science projects, leveraging cutting-edge technologies to implement transformative products. With a postgraduate degree, he is not only well-versed in the theoretical foundations of data science but is also passionate about sharing insights and knowledge. A captivating speaker, he engages audiences with a blend of expertise and enthusiasm, demystifying complex concepts in the world of data science.

资源下载资源下载价格10立即购买
1111

未经允许不得转载:电子书百科大全 » Data-Centric Machine Learning with Python: The ultimate guide to engineering and deploying high-quality models based on good data

评论 0

评论前必须登录!

登陆 注册