Library Hours
Monday to Friday: 9 a.m. to 9 p.m.
Saturday: 9 a.m. to 5 p.m.
Sunday: 1 p.m. to 9 p.m.
Naper Blvd. 1 p.m. to 5 p.m.
     
Limit search to available items
Record 16 of 154
Results Page:  Previous Next
Author Mukhopadhyay, Sayan, author.

Title Advanced data analytics using Python : with architectural patterns, text and image classification, and optimization techniques / Sayan Mukhopadhyay, Pratip Samanta. [O'Reilly electronic resource]

Edition Second edition.
Publication Info. New York, NY : Apress, [2023]
QR Code
Description 1 online resource (259 pages) : illustrations
Note Includes index.
Summary Understand advanced data analytics concepts such as time series and principal component analysis with ETL, supervised learning, and PySpark using Python. This book covers architectural patterns in data analytics, text and image classification, optimization techniques, natural language processing, and computer vision in the cloud environment. Generic design patterns in Python programming is clearly explained, emphasizing architectural practices such as hot potato anti-patterns. You'll review recent advances in databases such as Neo4j, Elasticsearch, and MongoDB. You'll then study feature engineering in images and texts with implementing business logic and see how to build machine learning and deep learning models using transfer learning. Advanced Analytics with Python, 2nd edition features a chapter on clustering with a neural network, regularization techniques, and algorithmic design patterns in data analytics with reinforcement learning. Finally, the recommender system in PySpark explains how to optimize models for a specific application.
Contents Intro -- Table of Contents -- About the Authors -- About the Technical Reviewer -- Acknowledgments -- Introduction -- Chapter 1: A Birds Eye View to AI System -- OOP in Python -- Calling Other Languages in Python -- Exposing the Python Model as a Microservice -- High-Performance API and Concurrent Programming -- Choosing the Right Database -- Summary -- Chapter 2: ETL with Python -- MySQL -- How to Install MySQLdb? -- Database Connection -- INSERT Operation -- READ Operation -- DELETE Operation -- UPDATE Operation -- COMMIT Operation -- ROLL-BACK Operation -- Normal Forms
First Normal Form -- Second Normal Form -- Third Normal Form -- Elasticsearch -- Connection Layer API -- Neo4j Python Driver -- neo4j-rest-client -- In-Memory Database -- MongoDB (Python Edition) -- Import Data into the Collection -- Create a Connection Using pymongo -- Access Database Objects -- Insert Data -- Update Data -- Remove Data -- Cloud Databases -- Pandas -- ETL with Python (Unstructured Data) -- Email Parsing -- Topical Crawling -- Crawling Algorithms -- Summary -- Chapter 3: Feature Engineering and Supervised Learning -- Dimensionality Reduction with Python -- Correlation Analysis
Principal Component Analysis -- Mutual Information -- Classifications with Python -- Semi-Supervised Learning -- Decision Tree -- Which Attribute Comes First? -- Random Forest Classifier -- Naïve Bayes Classifier -- Support Vector Machine -- Nearest Neighbor Classifier -- Sentiment Analysis -- Image Recognition -- Regression with Python -- Least Square Estimation -- Logistic Regression -- Classification and Regression -- Intentionally Bias the Model to Over-Fit or Under-Fit -- Dealing with Categorical Data -- Summary -- Chapter 4: Unsupervised Learning: Clustering -- K-Means Clustering
Choosing K: The Elbow Method -- Silhouette Analysis -- Distance or Similarity Measure -- Properties -- General and Euclidean Distance -- Squared Euclidean Distance -- Distance Between String-Edit Distance -- Levenshtein Distance -- Needleman-Wunsch Algorithm -- Similarity in the Context of a Document -- Types of Similarity -- Example of K-Means in Images -- Preparing the Cluster -- Thresholding -- Time to Cluster -- Revealing the Current Cluster -- Hierarchical Clustering -- Bottom-Up Approach -- Distance Between Clusters -- Single Linkage Method -- Complete Linkage Method
Subject Python (Computer program language)
Machine learning.
Data mining.
Python (Langage de programmation)
Apprentissage automatique.
Exploration de données (Informatique)
Data mining
Machine learning
Python (Computer program language)
Added Author Samanta, Pratip, author.
Other Form: Original 1484280040 9781484280041 (OCoLC)1288664670
ISBN 9781484280058 electronic book
1484280059 electronic book
Standard No. 10.1007/978-1-4842-8005-8 doi
Patron reviews: add a review
Click for more information
EBOOK
No one has rated this material

You can...
Also...
- Find similar reads
- Add a review
- Sign-up for Newsletter
- Suggest a purchase
- Can't find what you want?
More Information