Questions tagged [pentaho]

The Pentaho BI Suite is an open source business intelligence suite with integrated reporting, dashboard, data mining, workflow, and ETL capabilities. It is primarily written in Java.

The latest GA release of Pentaho Suite has four primary downloadable products.

PDI - Pentaho Data Integration is a graphical extract, transform, and load (ETL) solution that uses an innovative metadata-driven approach. PDI includes the DI Server, a design tool (aka Spoon), three utilities, and several plugins.

PBA - Pentaho Business Analytics formally known as Pentaho Business Intelligence (BI) is mainly used for dashboard development.

PRD - Pentaho Report Designer is used for developing business (B2B and B2C) reports.

PSW - Pentaho Schema workbench is used for developing OLAP cubes and schemas.

3605 questions
70
votes
9 answers

Pentaho Data Integration SQL connection

I am using Pentaho Data Integration and I am trying to connect to my database via MySQL but when I do I get this error..... Error connecting to database [devdb2] : org.pentaho.di.core.exception.KettleDatabaseException: Error occured while trying to…
user979331
  • 10,209
  • 56
  • 186
  • 339
38
votes
9 answers

How to get last 7 days data from current datetime to last 7 days in sql server

Hi I am loading table A data from sql server to mysql using pentaho when loading data i need to get only last 7 days data from sql server A table to mysql In sql server createddate column data type is like datetime AND In mysql created_on column…
SRI
  • 411
  • 1
  • 4
  • 11
37
votes
9 answers

Pentaho vs Microsoft BI Stack

My company is heavily invested in the MS BI Stack (SQL Server Reporting Services, -Analysis Services and -Integration Services), but I want to have a look at what the seemingly most talked about open-source alternative Pentaho is like. I've…
Tomas
  • 3,204
  • 2
  • 22
  • 28
26
votes
5 answers

Books to start learning big data

I would like to start learning about the big data technologies. I want to work in this area in the future. Does anyone know good books to start learning about it? Hadoop, HBase. Beginner - intermediate - advanced - Thanks in advance
Gunter Amorim
  • 49
  • 1
  • 5
  • 13
21
votes
1 answer

Using Pentaho Kettle, how do I load multiple tables from a single table while keeping referential integrity?

Need to load data from a single file with a 100,000+ records into multiple tables on MySQL maintaining the relationships defined in the file/tables; meaning the relationships already match. The solution should work on the latest version of MySQL,…
blunders
  • 3,449
  • 10
  • 39
  • 64
19
votes
2 answers

Using Pentaho Kettle, how do I automatically retry rest requests which fail due to connection hiccups?

How can we make Pentaho retry rest requests on connection errors? We have a Pentaho BI system which, among numerous data sources, is querying a particular REST api for over 20k query variations each run. Predicatably, on most runs a few of these…
ms-tg
  • 2,558
  • 20
  • 18
16
votes
5 answers

If Pentaho is Open Source, can I just use it?

I see that Pentaho wants to charge me for their software. How can I get to the underlying Open Source software for dashboards to see what it can do without having to deal with Pentaho marketing folks?
Suds
14
votes
2 answers

Pass DB Connection parameters to a Kettle a.k.a PDI table Input step dynamically from Excel

I have a requirement such that whenever i run my Kettle job, the database connection parameters must be taken dynamically from an excel source on each run. Say i have an excel with column names : HostName, Username, Database, Password. i want to…
Ritesh
  • 227
  • 1
  • 4
  • 13
13
votes
3 answers

Pentaho kettle: how to set up tests for transformations/jobs?

I've been using Pentaho Kettle for quite a while and previously the transformations and jobs i've made (using spoon) have been quite simple load from db, rename etc, input to stuff to another db. But now i've been doing transformations that do a bit…
hannesh
  • 492
  • 8
  • 15
13
votes
3 answers

Kettle / Pentaho Data Integration - unable to create a Database Connection (XulException: java.lang.reflect.InvocationTargetException)

Having finally got kettle to start and not hang, I still cannot use it to much avail, as when I try to create a new Database Connection (after creating a new Transformation) I get this error: org.pentaho.ui.xul.XulException:…
Blew my stack
  • 221
  • 4
  • 8
13
votes
1 answer

Error while exporting clob field from oracle to text field on Azure DB

I have a CLOB field on an Oracle Table that stores a PDF file. When I try export this field to SQL Server Db on Azure I got this error: 2017/01/19 11:14:32 - ImpostoRenda 2.0 - ERROR (version 6.1.0.1-196, build 1 from 2016-04-07 12.08.49 by…
Renatto Machado
  • 1,474
  • 1
  • 15
  • 29
13
votes
8 answers

Unable to connect to oracle via Pentaho. 'oracle.jdbc.driver.OracleDriver' could not be found

Hi I've just installed Pentaho Data Integration v. 5.01 and I'm trying to connect to a database. However, I'm unable to test the connection as I get the error (pasted below). I have JDK 1.7 installed on the system as well, however I can't seem to…
Joshua1729
  • 715
  • 3
  • 9
  • 18
12
votes
2 answers

Fast alternative to split in R

I'm partitioning a data frame with split() in order to use parLapply() to call a function on each partition in parallel. The data frame has 1.3 million rows and 20 cols. I'm splitting/partitioning by two columns, both character type. Looks like…
argoneus
  • 651
  • 7
  • 14
11
votes
5 answers

Maven Dependency for PDI(Pentaho Kettle) Jar files

I have written Java code to execute my transformation and Jobs and I have manually added all the Jar files present in the data-integration/lib folder to my class path and evrything is working fine. Now I want to mavenize my project and looking for…
sun_dare
  • 1,111
  • 2
  • 10
  • 32
11
votes
7 answers

Connect Pentaho to MS SQL Server (Native)

I am very new in using PENTAHO. I need to connect this software to MS SQL Server (Native). I already try to connect it many time but every time I test the connection, failed. This is the error message that occur every time I test the…
crystal
  • 225
  • 2
  • 3
  • 13
1
2 3
99 100