Pentaho Data Integration vs Dataloader.io vs Skyvia

Pentaho Data Integration and Dataloader.io both offer a data integration solution. Compare the features and benefits, data sources and destinations, and see which meets your needs. Look at the side-by-side comparison chart of the two data integration solutions.

Look at the side-by-side comparison chart

Pentaho Data Integration

vs

Dataloader.io

vs

Skyvia

About the Services

Pentaho Data Integration

Pentaho Data Integration (PDI) is an ETL tool with different offerings. You can choose between open-source and Enterprise Edition. Hitachi Vantara launched this in 2004. Since then, this tool has been serving a growing number of customers worldwide, and the feedback is positive.

PDI enables you to extract, transform, and load (ETL) data from a variety of sources. With PDI, you can integrate data to create meaningful insights and reports. These reports can help in business decisions.

PDI’s user interface is intuitive and easy to navigate. However, it can get slow at times, and customization of the UI is limited, according to some reviews in Gartner Peer Insights. And only 68% Would Recommend according to the same reviews.

PDI is a product you need to install and configure. That means you take care of security and privacy concerns on your own. Yet, you can deploy PDI in the cloud and use whatever certifications the cloud provider has. PDI also uses security providers like LDAP, Secure Socket Layers, and more. So, PDI is not lax on security and privacy.

Dataloader.io

Dataloader.io is a cloud solution to import, export, and delete Salesforce data powered by Mulesoft Anypoint Platform, the counterpart of the desktop version of the same name. It focuses on Salesforce data enabling imports and exports to and from CSV files. These CSV files may come from FTP, SFTP, or cloud storage like Box and DropBox.

Dataloader.io doesn't require registration as it supports Salesforce authentication. Much like the desktop version, it uses fill-in-the-blank forms and not drag-and-drop. You can't use any other source or target aside from the mentioned ones. You can choose a task between Import, Export, and Delete and follow the wizard to complete it. The difference between the cloud and the desktop version is in scheduling and notification features.

With Dataloader.io, your data is safe with security and privacy certifications. It is PCI Level 2, SSAE16, ISO 27001, SOC 1 and 2, and GDPR compliant. This cloud service lives in Amazon Web Services (AWS) and uses AWS privacy and security certifications.

Skyvia

Skyvia is a no-code cloud data integration platform for many data integration scenarios. It’s an all-rounder tool for ETL, ELT, Reverse ETL, data migration, one-way and bi-directional data sync, workflow automation, and more. Devart launched this fantastic product in 2014 for cloud data integration and backup.

Skyvia offers more than 180 ready-made data connectors. These are available for thousands of free users, including 2000+ paid customers. Big names like Hyundai and General Electric trust Skyvia to process their data. Its easy-to-use, drag-and-drop interface suits both IT professionals and business users. And don’t take our word for it. Listen to G2 reviewers about how easy it is to start and work with it. Data integration experts who used other tools can adapt with little to no help from support.

Skyvia has flexible pricing plans perfect for small startups and large enterprises. So, it makes it applicable to businesses of all sizes. Also, Skyvia’s freemium model allows users to start using it now and then decide if they need to upgrade later.

The safety of your data is also our prime concern. So, we hosted it in Microsoft Azure cloud, providing the best data security and privacy. It complies with a wide set of security standards, including SOC 2, ISO 27001, and many others.

Pentaho Data IntegrationDataloader.ioSkyvia
FocusETL, streaming dataETLData ingestion, ELT, ETL, reverse ETL, data sync, workflow automation.
Skill levelLow-code, no-code solution.No-code solution.No-code wizard. Top-rated as one of the easiest ETL tools by G2.
Sources40+CSV, Salesforce.180+
DestinationsSupported data sources.CSV, Salesforce.Supported data sources, including databases, data warehouses, cloud apps and flat files.
Database replicationFull or incremental load.Full or partial based on date filters.Full table and incremental via change data capture.
Ability for customers to add new data sourcesThrough JDBC or ODBC.NoneYe s, by request or using REST API connector.
G2 customer satisfaction4.3 out of 5
15 reviews
None4.8 out of 5
217 Reviews
Peer Insights satisfaction4.1
137 Ratings
4.2 out of 5
8 reviews
4.8
103 Ratings
Developer toolsPentaho Data Integration.
Pentaho Metadata Editor.
Pentaho Aggregation Designer.
Pentaho Schema Workbench.
Web-based interface.REST connector for data sources that have REST API.
Advanced ETL capabilitiesSupports Hadoop and Spark integrations for Big Data.NoneVisual ETL data pipeline designer with data orchestration capabilities.
Compliance and security certificationsNo provided security and privacy certifications.
User and Role management within PDI.
Supports security providers such as LDAP, MSAD, SSO, SSL, AES, JDBC.
PCI Level 1, SSAE16, AES-256 data encryption, ISO 27001, SOC 1 and 2, GDPR.HIPAA, GDPR, PCI DSS.
ISO 27001 and SOC 2 (by Azure).
Purchase processUse the free trial and talk to sales.Use the Free Tier or start the 30-day free trial then contact Sales.Self-service or sales.
Vendor lock-inPentaho Community Edition is open-source.
Talk to sales for more details.
MonthlyMonthly or annual contracts.
PricingCommunity Edition is free and open source.
Contact sales for Enterprise Edition.
30-day free trial of Enterprise Edition.
3 Pricing Plans (Free, Professional, Enterprise).
Row-based pricing.
With a 30-day free trial for Enterprise.
Volume-based and feature-based pricing. Freemium model allows to start with a free plan.

Connectors

Pentaho Data Integration

Pentaho Data Integration offers a wide range of data connectors through JDBC. So, you can connect to databases, cloud platforms, and big data sources like Hadoop. Some of the most popular ones are MySQL, SQL Server, Google Analytics, and Salesforce.

PDI allows you to create your own custom data connector. Or use pre-built connectors created by the Pentaho community. In addition, PDI offers a marketplace where you can find third-party connectors. So, you can also try your luck here if you don’t want to roll up your sleeves to build a connector.

Dataloader.io

There's little to say about Dataloader.io connectors because you can only use Salesforce data and CSV. Your CSVs should be in your local drive, FTP, or cloud storage like Box and DropBox. For Enterprise users, SFTP is also available.

Dataloader.io allows importing attachments to Salesforce from Zip files and exporting attachments from Salesforce to a Zip file. Each attachment relates to a Salesforce object, like Contacts or Events. When importing attachments, you should have a CSV file containing information about the attachments and the Zip file.

Trying an Import task shows a Database option as the data source, but it's not yet available.

If you need more data sources and destinations, it is better to choose another tool that has more connectors.

Skyvia

Skyvia offers more than 180 connectors, and more to come very soon. It supports connectors for CRMs, accounting, email marketing, e-commerce, human resources, marketing automation, payment processing, product management, all major databases and DWH, flat files, and more. It’s also not a problem whether your data is on-premise or in the cloud.

You can access your on-premise data with peace of mind using the Skyvia Agent. It allows you to connect to databases like SQL Server, MySQL, and more using an encrypted connection. You need to download the Skyvia Agent and install it. Then, download a secured key file and place it in the same folder as the Agent. The Agent is like an unbreakable metal door, and you use the key file to open that door to your on-premise data. You can also set it up so that Skyvia can access only the resources you specify and nothing else.

Customers can also leave a request for a new data connector. And Skyvia will prioritize building it without additional payment.

Transformation

Pentaho Data Integration

Pentaho Data Integration (PDI) allows you to transform data from one format to another. With PDI, customers can perform a variety of data transformations. Some of these are simple operations like filtering and sorting. There also are complex operations like pivoting, joining, and data validation.

PDI lets you drag and drop transformations into place. No need to write any code. But if you need custom transformations, you can code in languages like Java.

Dataloader.io

In Dataloader.io, you only specify a source Salesforce object or the CSV file. Then, you map the columns. There’s no option to transform data further.

If you need further transformation, you can use an external tool and work on the CSV files.

Skyvia

Skyvia is a full-featured ETL service that allows powerful data transformations. It is a no-code solution allowing data splitting, conversion, lookups, and many more.

You can use the Skyvia Data Flow and Control Flow for advanced data pipelines. Transformations for these advanced pipelines are flexible. It supports extending your data with new columns, conditional flows, and summarized values. And all these you can do with parameters, variables, and more for flexibility without code.

Moreover, Skyvia has an Expression Builder to build formulas with many functions. With this, you can convert or extract parts of the data or form new values to suit your needs. And if you love coding in SQL, Skyvia can further extend your transformation needs. It supports multiple joins, groupings, CASE expressions, and more in SELECT queries. And you can also use DML commands like INSERT, UPDATE, and DELETE.

Support

Pentaho Data Integration

Pentaho Data Integration (PDI) offers various levels of support. You can access support through the Hitachi Vantara support portal. From there you’ll see the knowledgebase articles, product documentation, and community forums.

PDI customers have access to standard support. This includes email support and access to the support portal during business hours. Premium support is also available. It provides 24/7 phone support, priority handling, and faster response times. Service Level Agreements (SLAs) are available for premium support.

You can also access a range of professional services like consulting and training.

Dataloader.io

You have two support options in Dataloader.io: email and community. You only have the community to support you if you have the Free plan. The paid plans allow you also to have email support.

But for all paid plans, there’s a help center with videos and instructions for Dataloader.io tasks. Note that there’s no SLA or priority support available.

Skyvia

Skyvia offers free email, chat (on the website or in-app), and forum support for all customers. It also provides extensive documentation with lots of tutorials and user guides.

For paid customers, there's also a phone support option and additional support options for Enterprise customers.

Pricing

Pentaho Data Integration

Pentaho Data Integration (PDI) offers a free Community Edition. It is also open source but comes with limited features. If you’re looking for more advanced features and support, try the Enterprise Edition. But, the website doesn’t mention the pricing for the Enterprise Edition.

But, PDI offers a 30-day trial for the Enterprise Edition. It allows you to try out the full range of features and see if this product is for you. If you decide to buy it, you’ll be able to access premium support services as well.

Dataloader.io

There are three pricing plans for Dataloader.io: Free, Professional, and Enterprise. Each plan has its limits on what you can do within Dataloader.io.

You can try the Free plan immediately if you have a Salesforce account. The Free plan allows you to process 10,000 rows per month and a CSV file limit of 10MB. CSV locations available to you are your local drive, FTP, Box, and Dropbox. You can only have one scheduled task, which expires after 30 days. If you need help, only the community can provide it.

With the Professional plan, the number of rows increases to 100,000/month, and a CSV file limit extends to 50MB. You can also have 50 scheduled tasks that never expire. If you need help, you can send an email or ask the community. There’s no free trial for the Professional plan. Try the Free plan first. Then, extend the limits by paying per month per user.

The Enterprise plan offers a 30-day free trial. You can process unlimited rows monthly, and your file limit increases to 100MB. You also have flexible security and unlimited scheduled tasks.

Check out the pricing page for Dataloader.io for more details.

Skyvia

Skyvia Data Integration is a freemium tool with an option to request a 14-day trial. So, price is not a barrier to entry.

And when you’re ready, paid plans start from $19 per month. Pricing tiers depend on a few factors. It includes the number of loaded records, scheduling frequency, and advanced ETL features. There are no sale commitments. And customers can upgrade or downgrade at any time. Check out a detailed comparison here.

If you doubt the price is worth it, check out review sites like G2. Aside from ease of use, reasonable pricing is one of the things Skyvia customers like. So, you can be sure the features you get are worth every penny.