Library Hours
Monday to Friday: 9 a.m. to 9 p.m.
Saturday: 9 a.m. to 5 p.m.
Sunday: 1 p.m. to 9 p.m.
Naper Blvd. 1 p.m. to 5 p.m.
     
Limit search to available items
Results Page:  Previous Next
Author Meadows, Alex, author.

Title Pentaho data integration cookbook : over 100 recipes for building open source ETL solutions with Pentaho data integration / Alex Meadows, Adrián Sergio Pulvirenti, María Carina Roldán. [O'Reilly electronic resource]

Edition Second edition.
Publication Info. Birmingham : Packt Publishing, 2013.
QR Code
Description 1 online resource (462 pages)
text file
Bibliography Includes bibliographical references and index.
Contents Cover -- Copyright -- Credits -- About the Author -- About the Reviewers -- www.PacktPub.com -- Table of Contents -- Preface -- Chapter 1: Working with Databases -- Introduction -- Connecting to a database -- Getting data from a database -- Getting data from a database by providing parameters -- Getting data from a database by running a query built at runtime -- Inserting or updating rows in a table -- Inserting new rows where a simple primary key has to be generated -- Inserting new rows where the primary key has to be generated based on stored values
Deleting data from a tableCreating or altering a database table from PDI (design time) -- Creating or altering a database table from PDI (runtime) -- Inserting, deleting, or updating a table depending on a field -- Changing the database connection at runtime -- Loading a parent-child table -- Building SQL queries via database metadata -- Performing repetitive database design tasks from PDI -- Chapter 2: Reading and Writing Files -- Introduction -- Reading a simple file -- Reading several files at the same time -- Reading semi-structured files
Reading files having one field per rowReading files with some fields occupying two or more rows -- Writing a simple file -- Writing a semi-structured file -- Providing the name of a file (for reading or writing) dynamically -- Using the name of a file (or part of it) as a field -- Reading an Excel file -- Getting the value of specific cells in an Excel file -- Writing an Excel file with several sheets -- Writing an Excel file with a dynamic number of sheets -- Reading data from an AWS S3 Instance -- Chapter 3: Working with Big Data and Cloud Sources
IntroductionLoading data into Salesforce.com -- Getting data from Salesforce.com -- Loading data into Hadoop -- Getting data from Hadoop -- Loading data into HBase -- Getting data from HBase -- Loading data into MongoDB -- Getting data from MongoDB -- Chapter 4: Manipulating XML Structures -- Introduction -- Reading simple XML files -- Specifying fields by using Path notation -- Validating well-formed XML files -- Validating an XML file against DTD definitions -- Validating an XML file against an XSD schema -- Generating a simple XML document
Generating complex XML structuresGenerating an HTML page using XML and XSL transformations -- Reading an RSS Feed -- Generating an RSS Feed -- Chapter 5: File Management -- Introduction -- Copying or moving one or more files -- Deleting one or more files -- Getting files from a remote server -- Putting files on a remote server -- Copying or moving a custom list of files -- Deleting a custom list of files -- Comparing files and folders -- Working with ZIP files -- Encrypting and decrypting files -- Chapter 6: Looking for Data -- Introduction
Summary Pentaho Data Integration Cookbook Second Edition is written in a cookbook format, presenting examples in the style of recipes. This allows you to go directly to your topic of interest, or follow topics throughout a chapter to gain a thorough in-depth knowledge. Pentaho Data Integration Cookbook Second Edition is designed for developers who are familiar with the basics of Kettle but who wish to move up to the next level. It is also aimed at advanced users that want to learn how to use the new features of PDI as well as and best practices for working with Kettle.
Language English.
Subject Data warehousing.
Database management -- Computer programs.
Open source software.
Data structures (Computer science)
Data integration (Computer science)
Database Management Systems
Entrepôts de données (Informatique)
Bases de données -- Gestion -- Logiciels.
Logiciels libres.
Structures de données (Informatique)
Intégration de données (Informatique)
Data integration (Computer science)
Data structures (Computer science)
Data warehousing
Database management -- Computer programs
Open source software
Added Author Pulvirenti, Adrián Sergio, author.
Roldá, María Carina, author.
Other Form: 1783280670
130618052X
ISBN 9781783280681 (electronic bk.)
1783280689 (electronic bk.)
Patron reviews: add a review
Click for more information
EBOOK
No one has rated this material

You can...
Also...
- Find similar reads
- Add a review
- Sign-up for Newsletter
- Suggest a purchase
- Can't find what you want?
More Information