|
More site info...
Data Integration / ETL | Forum profile
|
|
Forum profile page for Data Integration / ETL on http://www.pentaho.org.
This report page is the aggregated overview from a single forum: Data Integration / ETL, located on the Message Board at http://www.pentaho.org.
This forum profile page summarizes the general forum statistics such as: Users Activity, Forum Activity, and Top Authors, which are reported in either a table or graph below for a given reporting time period.
Additional forum profile information for "Data Integration / ETL" on the Message Board at http://www.pentaho.org is also shown in the following ways:
1) Latest Active Threads
2) Hot Threads for Last Week
Warning: These statistics are generated using 'best efforts' and can experience delays and reporting errors at times. Please note that such statistics do not constitute a forum's popularity and/or exact posting volumes at any given reporting period.
|
|
|
|
|
Posting activity on Data Integration / ETL:
|
|
Week
|
Month
|
3 Months
|
|
Threads:
|
98
|
362
|
914
|
|
Post:
|
234
|
902
|
2,304
|
|
|
Data Integration / ETL Posting activity graph:
|
Top authors during last week:
user's latest post:
Data Validation Step - Clarity
Published (2009-11-13 16:27:00)
The Frog, That's what the second option is for (3.2.0GA) : Code: [ ] Output one row, concatenate errors with separator [_____] So the answer is that you get N rows for N validation errors per input row, unless you enable the option above.
user's latest post:
Reference the step "copy...
Published (2009-11-13 19:33:00)
Find a step that supports variables (the grey diamond with a red $ in it) and press CTRL-Space. That will show you a list of them. Most have very descriptive names. Other than that... Not that I know of. Feel free to write one and post it on the wiki
user's latest post:
Pre-compile or run once required...
Published (2009-11-13 16:32:00)
Quote: Originally Posted by gutlez Then I guess you're beyond where I can help you - without having access to your java program and the KTR files (and DBs that underly the java program), I can't try to diagnose any further. If you only got one row in the text File (excluding the header), then it *should* work. Sadly, should != is. hey np gutlez !! thanks for always trying to help, thanks for your time. Issue resolved. i'm a...
user's latest post:
Encrypting kettle.properties
Published (2009-11-13 03:59:00)
It has come up a couple of times (can we put it in the FAQ somehow?) - the basic problem is that you will need to have the key right next to the properties file itself. So if you need to protect the properties file your will have to equally protect the key. simple example? Imagine your live environment. There is a PDI installation, the application user 'app' has a .kettle/kettle.properties file with DB passwords and these things. Now...
user's latest post:
Mail transformation
Published (2009-11-12 18:22:00)
or split it into two transformations....one transformation for storing output text file and another for sending email. Put both of them in a job..
user's latest post:
Need suggestions to improve...
Published (2009-11-04 18:48:00)
Can some one give some tips/suggestions to improve performance. We have a transformation that supposed to load around 17000 rows from source(SQL Server) to target(Oracle). It is taking 90 minutes. by setting commit size to 10 K, it took 80 minutes. so changing the commit size is also not helping much. Tips/suggestions to improve performance would be appreciated.
user's latest post:
Read/write file from Javascript
Published (2009-11-13 18:01:00)
Hi, I need to write a pretty complicated code such as loops on the columns on a file, calculations etc', that i don't think Kettle supports. I'm thinking of implementing it by reading a text file from javascript and do all the processing inside the javascript. is there a way to open a file for read/write from javascript? thanks
user's latest post:
Data Validation Step - Clarity
Published (2009-11-13 11:46:00)
Hi Everyone, I am seeking clarity on an option in the Data Validation step. There is an option to 'report all errors, not only the first'. Seems pretty clear . My query is whether or not this is done on a per input row basis, ie/ each error for each row produces a new error (each validation), or do you get one error per row? I am just sketching out a design and wanted to know for sure. Any help appreciated. Cheers The Frog
user's latest post:
When XML input boms, you...
Published (2009-11-09 05:05:00)
regardless of this issue; I still dont feel i can use this in a production scenario because it'll break my whole transformation if there is malformed xml - there's no way i can handle it right? So i'll raise a bug of there's no work around?
user's latest post:
Insert New Rows in Data...
Published (2009-11-10 22:56:00)
What is the best transformation step to build a datawarehouse table with. Specifically, I need to add data each day to a datawarehouse table. Should I use Insert/Update? or Table Output? or............something else? and how do I make sure I am getting all the records since the last entry in the table? Thanks - Kinda Rookie, but you are going to save me a lot of time........
|
|
|
|
Latest active threads on Data Integration / ETL::
Started 1 day, 17 hours ago (2009-11-13 14:19:00)
by gutlez
Do you really have a variable {dbname} ?
Try changing it to ${dbname}
Set the variable "dbname" (for testing, no quotes...) in the "Edit" -> "Set Environment Variables" Menu Item.
Give it a valid ODBC name ...
Then click your test.
Started 1 day, 9 hours ago (2009-11-13 22:30:00)
by pstoellberger
well
ad 1) can't think of anything. thats clearly a connection error
ad 2) i don't know how you process the files.. but just change the evaluation to "unconditional" after that transformation
Started 1 day, 13 hours ago (2009-11-13 18:18:00)
by gutlez
Does the variable ${Internal.Step.CopyNr} work for that?
It's a 0-based variable (ie. 5 steps would be numbered 0,1,2,3,4)
Started 4 months, 4 weeks ago (2009-06-19 14:06:00)
by osbock osbock is offline Junior Member
I'm having trouble getting what I need out of the Regex component.
My regex is currently in use in a java program and works fine in regexbuddy, and other online regex verifiers. Is there some sort of quoting or something that I'm missing?
Here's the regex, designed to extract a search query from a stored cookie that contains a request string:
(?:&|\\?)q=(.{1,}?)(?:&|$)
and here's...
Started 1 week, 1 day ago (2009-11-06 10:32:00)
by codek codek is offline Senior Member
Hi,
I'm trying to use XML input to read data from an xml field.
I get the error below. I added error handling to re-direct the data to a different file so i could debug it, and it doesnt work, the transform still exits unexpectedly.
Any ideas? Shouldnt error handling deal with this? (3.2.2)
Thanks,
Dan
2009/11/06 14:30:43 - Process Request XML.0 - ERROR (version 3.2.2-...
Started 3 days, 1 hour ago (2009-11-12 06:32:00)
by MattCasters
If you need variables to be set, you need to set them. That's all that's required.
Started 1 day, 15 hours ago (2009-11-13 16:27:00)
by MattCasters
The Frog,
That's what the second option is for (3.2.0GA) :
Code:
[ ] Output one row, concatenate errors with separator [_____]
So the answer is that you get N rows for N validation errors per input row, unless you enable the option above.
Started 2 days, 14 hours ago (2009-11-12 17:57:00)
by gutlez
Which step are you trying to use?
|
|
Hot threads for last week on Data Integration / ETL::
Started 3 days, 1 hour ago (2009-11-12 06:32:00)
by MattCasters
If you need variables to be set, you need to set them. That's all that's required.
Started 1 week, 1 day ago (2009-11-06 10:32:00)
by codek codek is offline Senior Member
Hi,
I'm trying to use XML input to read data from an xml field.
I get the error below. I added error handling to re-direct the data to a different file so i could debug it, and it doesnt work, the transform still exits unexpectedly.
Any ideas? Shouldnt error handling deal with this? (3.2.2)
Thanks,
Dan
2009/11/06 14:30:43 - Process Request XML.0 - ERROR (version 3.2.2-...
Started 1 week, 1 day ago (2009-11-07 02:46:00)
by DEinspanjer
I'd suggest looking at the following steps:
1. Get sorted data from table
2. Get sorted new data
3. Join #1 and #2 with a Merge Diff Step
4. Filter the Joined data for only changed records
5. Calculator step using NVL to get the non- null value for every field (if it exists)
6. Update step inserting your NVLed fields
Started 5 days, 7 hours ago (2009-11-10 00:22:00)
by roco roco is offline Senior Member
Just looking to apply a big filter to a transformed set of rows using 200 different filter strings(OR conditions). Seems a bit cumbersome to add 200 "OR" conditions in the Filter Step so I was hoping that the Step would accept comma separated values or something like that? Is there a better way? Thank you.
Oh also, how do I also set the filter to be Case Insensitive?
Started 2 days, 14 hours ago (2009-11-12 17:57:00)
by gutlez
Which step are you trying to use?
Started 4 days, 16 hours ago (2009-11-10 15:17:00)
by rhaces
I, I have a JDBC issue when doing an SQL with the "Execute SQL script" step. I do an insert of millions of rows and the problem is that the JDBC cant grab the number of rows inserted, so I get the following Error: "The update count exceeded Integer.MAX_VALUE.", but the sql operation is done correctly. Talking to the jdbc developers they told me that the JDBC ...
Started 5 days, 9 hours ago (2009-11-09 22:40:00)
by sayap
I supposed this is caused by the following line in LogWriter.java?
Code:
pentahoLogger.setLevel(Level.ALL);
Why do we want to do this inside the code, instead of just setting it so in the default log4j.xml?
Started 1 week, 1 day ago (2009-11-06 09:12:00)
by retiefdv retiefdv is offline Junior Member
I have noticed that the Spoon splash screen displays as a black square on Ubuntu 9.10. See attached screenshot image. I know this is just a cosmetic issue, but I thought you might be interested to know about the problem.
Started 3 days, 1 hour ago (2009-11-12 06:31:00)
by MattCasters
Hi Ferhat,
I'm not quite sure what problem this is supposed to solve.
What exactly is your problem? These variables can already be set in kettle.properties. So perhaps it's something else?
Feel free to share the experience. As Dr Indiana Jones once said: "Nothing shocks me, I'm a scientist" ;-)
Matt
Started 1 day, 13 hours ago (2009-11-13 18:18:00)
by gutlez
Does the variable ${Internal.Step.CopyNr} work for that?
It's a 0-based variable (ie. 5 steps would be numbered 0,1,2,3,4)
|
|