I have a high level requirement like I need to identify an ETL tool for loading into a new Data warehouse.
So need to do dimensional modelling as well.
Anyways modelling is independent of selecting ETL tool.
a) Input data from source is having 3.5 million rows per day or 7GB data coming up and projected increase after 3 yrs would be 18 million or 40 GB approx.
Trying to understand which should be the best ETL tool to choose, assuming that it has medium complexity transforms and source is an OLTP layer.
Kindly assist me in choosing the Tool as I am shortlisting INFORMATICA where it has the capacity to do parallel processing.