Posts

Showing posts from 2012

Advantages / Uses of ETL

- Data-warehouse population - ETL tool can be used to populate the dimensions & fact tables of a data-warehouse - Data migration - ETL tool can be used to migrate or transfer the data between variety of sources like databases, CRMs, cloud, files, emails etc. - Transferring huge data - ETL tools are used to benefit in the transfer rate as they benefit from different technologies like parallel processing, selective processing, etc. - Data cleansing- ETL tool can be used to cleanse your data i.e. remove any inconsistent or ambiguous data   - Source for reports & analysis - ETL readies the data for further representation in the data warehouse in the form of reports or analysis. - Big data support - ETL tool supports many big data options such as Hadoop, etc.

Master Data Management In PDI

MDM comprises a set of processes, governance, policies, standards and tools that consistently defines and manages the master data (i.e. non-transactional data entities) of an organization (which may include reference data) PROCESS STEPS IN PENTAHO DATA INTEGRATION Profiling DataCleaner plug-in Data Quality Fuzzy Match,  Validation, Scripting steps,  Calculator (different functions), Filter, Error Handling

Increase java memory allocation for Talend Integration Suite

Edit the .ini file (Configuration setting) (TISPE-win32-x86_64.ini for 64 bit version windows, TISPE-win32-x86.ini for 32 bit version windows version) (Suppose you have 8 gb ram you can allocate 4 gb to tis) -vmargs -Xms500m -Xmx4096m -XX:MaxPermSize=512m