|
what is '1000 projects'?
'fullinterview.com' is a educational content website dedicated to finding and realizing final year projects for btech, be, mtech, mca students, here you can search, find your projects and get guidance from experts the below are the different technological projects.
visual Studio projects
.net projects,
asp projects,
c & ds projects,
c++ projects (all),
cold fusion projects,
delphi projects,
java projects,
perl projects,
php projects,
sql projects,
vc++ projects,
visual basic projects.
how it works?
well, everything on this site is submitted by the student and professional community. after you submit your project, it is being verified and approved by our administrator. after approval, other people can read/discuss it, save to favorites.
more number of projects?
here you can find morethan 1000 projects on different technologies, if u want to get more projects please visit our sister sites www.fullinterview.com & Chetanasprojects.com
Category Articles
What is the difference between sequential file and a dataset? When to use the copy stage?
Added on Thu, Dec 3, 2009
Sequentiial Stage stores small amount of the data with any extension in order to acces the file where as DataSet is used to store Huge amount of the data and it opens only with an extension (.ds ) .The Copy stage copies a single input data set to a... Read More
How can you implement Complex Jobs in datastage
Added on Thu, Dec 3, 2009
Complex design means having more joins and more look ups. Then that job design will be called as complex job.We can easily implement any complex design in DataStage by following simple tips in terms of increasing performance also. There is no... Read More
What is difference between data stage and informatica
Added on Thu, Dec 3, 2009
Here is a very good articles on these differences... whic hhelps to get an idea.. basically it’s depends on what you are tring to accomplish what are the requirements for your ETL tool? Do you have large sequential files (1 million rows, for... Read More
How we can call the routine in datastage job?explain with steps?
Added on Thu, Dec 3, 2009
Routines are used for impelementing the business logic they are two types 1) Before Sub Routines and 2)After Sub Routinestepsdouble click on the transformer stage right click on any one of the mapping field select [dstoutines] option within edit... Read More
How to implement type2 slowly changing dimensions in data stage?explain with example?
Added on Thu, Dec 3, 2009
We can handle SCD in the following ways Type 1: Just use, ?Insert rows Else Update rows? Or ?Update rows Else Insert rows?, in update action of target Type 2: Use the steps as follows... Read More
What is the transaction size and array size in OCI stage?how these can be used?
Added on Thu, Dec 3, 2009
Transaction Size - This field exists for backward compatibility, but it is ignored for release 3.0 and later of the Plug-in. The transaction size for new jobs is now handled by Rows per transaction on the Transaction Handling tab on the Input page. ... Read More
How can you do incremental load in datastage?
Added on Thu, Dec 3, 2009
Incremental load means daily load. when ever you are selecting data from source, select the records which are loaded or updated between the timestamp of lastsuccessful load and todays load start date and time. for this u have to pass parameters for... Read More
How can we improve the performance of DataStage jobs?
Added on Thu, Dec 3, 2009
Performance and tuning of DS jobs: 1.Establish Baselines 2.Avoid the Use of only one flow for tuning/performance testing 3.Work in increment 4.Evaluate data skew 5.Isolate and solve 6.Distribute file... Read More
How can we write parallel routines in data stage PX, can anyone help out in this one quickly. If is any example programe its very helpful to me
Added on Thu, Dec 3, 2009
First you know what is routines:Routines are set of functions and its defined by DS manager and call trough help with trasformer stage. and you go to DS manager select routines left side of the window and click on routines then one pop up winndow... Read More
What is hashing algorithm and explain breafly how it works?
Added on Thu, Dec 3, 2009
Hashing is key-to-address translation. This means the value of a key is transformed into a disk address by means of an algorithm, usually a relative block and anchor point within the block. It’s closely related to statistical probability as to... Read More
If a DataStage job aborts after say 1000 records, how to continue the job from 1000th record after fixing the error?
Added on Thu, Dec 3, 2009
By specifying Checkpointing in job sequence properties, if we restart the job. Then job will start by skipping upto the failed record.this option is available in 7.5 edition. Read More
How i create datastage Engine stop start script.
Added on Thu, Dec 3, 2009
Question : How i create datastage Engine stop start script. Actually my idea is as below. !# bin/bash dsadm - user su - root password (encript) DSHOMEBIN=/Ascential/DataStage/home/dsadm/Ascential/DataStage/DSEngine/bin if check ps -ef... Read More
What Happens if RCP is disable ?
Added on Thu, Dec 3, 2009
Runtime column propagation (RCP): If RCP is enabled for any job, and specifically for those stage whose output connects to the shared container input, then meta data will be propagated at run time, so there is no need to map it at design time. If... Read More
How we use NLS function in Datastage? what are advantages of NLS function? where we can use that one? explain briefly?
Added on Thu, Dec 3, 2009
By using NLS function we can do the following - Process the data in a wide range of languages - Use Local formats for dates, times and money - Sort the data according to the local rules If NLS is installed,... Read More
What are the environment variables in datastage?give some examples?
Added on Thu, Dec 3, 2009
Theare are the variables used at the project or job level.We can use them to to configure the job ie.we can associate the configuration file(Wighout this u can not run ur job), increase the sequential or dataset read/ write buffer. ex: ... Read More
How to improve the performance of hash file?
Added on Thu, Dec 3, 2009
You can inprove performance of hashed file by 1 .Preloading hash file into memory -->this can be done by enabling preloading options in hash file output stage 2. Write caching options -->.It makes data written into cache before being... Read More
What is the exact difference betwwen Join,Merge and Lookup Stage??
Added on Thu, Dec 3, 2009
The exact difference between Join,Merge and lookup is The three stages differ mainly in the memory they use DataStage doesn’t know how large your data is, so cannot make an informed choice whether to combine data using a join stage or a... Read More
What’s the difference between Datastage Developers and Datastage Designers. What are the skill’s required for this.
Added on Thu, Dec 3, 2009
Datastage developer is one how will code the jobs.datastage designer is how will desgn the job, i mean he will deal with blue prints and he will design the jobs the stages that are required in developing the code Read More
What is difference between Merge stage and Join stage?
Added on Thu, Dec 3, 2009
Merge and Join Stage Difference : 1. Merge Reject Links are there 2. can take Multiple Update links 3. If you used it for comparision , then first matching data will be the output . Because it uses the update links to extend the primary details... Read More
What is APT_CONFIG in datastage
Added on Thu, Dec 3, 2009
APT_CONFIG is just an environment variable used to idetify the *.apt file. Dont confuse that with *.apt file that has the node’s information and Configuration of SMP/MMP server. Read More
What is the difference between LOOK UP File Stage and LookUP stage ?
Added on Thu, Dec 3, 2009
LookUP stage is used to deal on reference data set with source data . where as LOOK UP File Stage is used to create the reference data set for the look up stage for to perform the look up operation with the source data. Read More
Will the data stage consider the second constraint in the transformer once the first condition is satisfied ( if the link odering is given)
Added on Thu, Dec 3, 2009
Will Datastage consider the second constraint in the transformer if the first constraint is satisfied (if link ordering is given)?" Answer: Yes. Read More
How to implement routines in data stage,have any one has any material for data stage pl send to me
Added on Thu, Dec 3, 2009
There are 3 kind of routines is there in Datastage. 1.server routines which will used in server jobs. these routines will write in BASIC Language 2.parlell routines which will used in parlell jobs These routines will write in C/C++ Language... Read More
What are all the third party tools used in DataStage?
Added on Thu, Dec 3, 2009
Autosys, TNG, event coordinator are some of them that I know and worked with Read More
How do you remove duplicates without using remove duplicate stage?
Added on Thu, Dec 3, 2009
In the target make the column as the key column and run the job.<br> Read More
What is the difference between Symetrically parallel processing,Massively parallel processing?
Added on Thu, Dec 3, 2009
Symmetric Multiprocessing (SMP) - Some Hardware resources may be shared by processor. Processor communicate via shared memory and have single operating system. Cluster or Massively Parallel Processing (MPP) - Known as shared nothing in which each... Read More
How to find errors in job sequence?
Added on Thu, Dec 3, 2009
using DataStage Director we can find the errors in job sequence Read More
I want to process 3 files in sequentially one by one , how can i do that. while processing the files it should fetch files automatically .
Added on Thu, Dec 3, 2009
If the metadata for all the files r same then create a job having file name as parameter, then use same job in routine and call the job with different file name...or u can create sequencer to use the job... Read More
Whats difference betweeen operational data stage (ODS) & data warehouse?
Added on Thu, Dec 3, 2009
A dataware house is a decision support database for organisational needs.It is subject oriented,non volatile,integrated ,time varient collect of data. ODS(Operational Data Source) is a integrated collection of related information . it contains... Read More
How to handle Date convertions in Datastage? Convert a mm/dd/yyyy format to yyyy-dd-mm?
Added on Thu, Dec 3, 2009
Here is the right conversion: Function to convert mm/dd/yyyy format to yyyy-dd-mm is Oconv(Iconv(Filedname,"D/MDY[2,2,4]"),"D- YDM[4,2,2]"). Read More
What is the max capacity of Hash file in DataStage?
Added on Thu, Dec 3, 2009
Take a look at the uvconfig file: # 64BIT_FILES - This sets the default mode used to # create static hashed and dynamic files. # A value of 0 results in the creation of 32-bit # files. 32-bit files have a maximum file size of # 2 gigabytes.... Read More
How I can convert Server Jobs into Parallel Jobs?
Added on Thu, Dec 3, 2009
I have never tried doing this, however, I have some information which will help you in saving a lot of time. You can convert yourserver job into a server shared container. The server shared container can also be used in parallel jobs as shared... Read More
It is possible to call one job in another job in server jobs?
Added on Thu, Dec 3, 2009
I think we can call a job into another job. In fact calling doesn’t sound good, because you attach/add the other job through job properties. In fact, you can attach zero or more jobs. Steps will be Edit --> Job Properties --> Job... Read More
Difference between Hashfile and Sequential File?
Added on Thu, Dec 3, 2009
Hash file stores the data based on hash algorithm and on a key value. A sequential file is just a file with no key column. Hash file used as a reference for look up. Sequential file cannot Read More
How will you call external function or subroutine from datastage?
Added on Thu, Dec 3, 2009
There is datastage option to call external programs . execSH Read More
Do you know about INTEGRITY/QUALITY stage?
Added on Thu, Dec 3, 2009
Integriry/quality stage is a data integration tool from ascential which is used to staderdize/integrate the data from different sources Read More
What are the enhancements made in datastage 7.5 compare with 7.0
Added on Thu, Dec 3, 2009
Many new stages were introduced compared to datastage version 7.0. In server jobs we have stored procedure stage, command stage and generate report option was there in file tab. In job sequence many stages like startloop activity, end loop... Read More
How to kill the job in data stage?
Added on Thu, Dec 3, 2009
By killing the respective process ID Read More
What is iconv and oconv functions?
Added on Thu, Dec 3, 2009
Iconv( )-----converts string to internal storage formatOconv( )----converts an expression to an output format Read More
Dimension Modelling types along with their significance
Added on Thu, Dec 3, 2009
Data Modeling 1) E-R Diagrams 2) Dimensional modeling 2.a) logical modeling 2.b)Physical modeling Read More
What are the different types of lookups in datastage?
Added on Thu, Dec 3, 2009
There are two types of lookupslookup stage and lookupfilesetLookup:Lookup refrence to another stage or Database to get the data from it and transforms to other database.LookupFileSet:It allows you to create a lookup file set or reference one for a... Read More
what is the mean of Try to have the constraints in the ’Selection’ criteria of the jobs itself. This will eliminate the unnecessary records even getting in before joins are made?
Added on Thu, Dec 3, 2009
This means try to improve the performance by avoiding use of constraints wherever possible and instead using them while selecting the data itself using a where clause. This improves performace. Read More
What are types of Hashed File?
Added on Thu, Dec 3, 2009
Hashed File is classified broadly into 2 types. a) Static - Sub divided into 17 types based on Primary Key Pattern. b) Dynamic - sub divided into 2 types i) Generic ii) Specific. Default Hased file is "Dynamic - Type30. Read More
What are Routines and where/how are they written and have you written any routines before?
Added on Thu, Dec 3, 2009
RoutinesRoutines are stored in the Routines branch of the DataStage Repository,where you can create, view, or edit them using the Routine dialog box. Thefollowing program components are classified as routines:? Transform functions. These are... Read More
How do we do the automation of dsjobs?
Added on Thu, Dec 3, 2009
We can call Datastage Batch Job from Command prompt using ’dsjob’. We can also pass all the parameters from command prompt. Then call this shell script in any of the market available schedulers. The 2nd option is schedule these jobs... Read More
How do you eliminate duplicate rows??
Added on Thu, Dec 3, 2009
Use Remove Duplicate Stage: It takes a single sorted data set as input, removes all duplicate records, and writes the results to an output data set. Read More
How do you pass the parameter to the job sequence if the job is running at night?
Added on Thu, Dec 3, 2009
Two ways 1. Ste the default values of Parameters in the Job Sequencer and map these parameters to job. 2. Run the job in the sequencer using dsjobs utility where we can specify the values to be taken for each parameter. Read More
How can I specify a filter command for processing data while defining sequential file output data?
Added on Thu, Dec 3, 2009
We have some thing called as after job subroutine and Before subroutine, with then we can execute the Unix commands. Here we can use the sort sommand or the filter cdommand Read More
What are other Performance tunings you have done in your last project to increase the performance of slowly running jobs?
Added on Thu, Dec 3, 2009
Minimise the usage of Transformer (Instead of this use Copy, modify, Filter, Row Generator) Use SQL Code while extracting the data Handle the nulls Minimise the warnings Reduce the number of lookups in a job design Use not more than 20stages in... Read More
What does separation option in static hash-file mean?
Added on Thu, Dec 3, 2009
The different hashing algorithms are designed to distribute records evenly among the groups of the file based on charecters and their position in the record ids. When a hashed file is created, Separation and Modulo respectively specifies the group... Read More
How do you fix the error "OCI has fetched truncated data" in DataStage
Added on Thu, Dec 3, 2009
Can we use Change capture stage to get the truncated data’s.Members please confirm Read More
Why do you use SQL LOADER or OCI STAGE?
Added on Thu, Dec 3, 2009
When the source data is anormous or for bulk data we can use OCI and SQL loader depending upon the source Read More
How to handle the rejected rows in datastage?
Added on Thu, Dec 3, 2009
We can handle rejected rows in two ways with help of Constraints in a Tansformer.1) By Putting on the Rejected cell where we will be writing our constarints in the properties of the Transformer2)Use REJECTED in the expression editor of the... Read More
How to implement slowly changing dimentions in Datastage?
Added on Thu, Dec 3, 2009
Slowly changing dimensions is concept of DWH. Datastage is tool for ETL purpose not for Slowly changing dimensions. Read More
wht is the difference beteen validated ok and compiled in datastage.
Added on Thu, Dec 3, 2009
When we say "Validating a Job", we are talking about running the Job in the "check only" mode. The following checks are made : - Connections are made to the data sources or data warehouse. - SQL SELECT statements are prepared. - Files are opened.... Read More
What is the meaning of the following..
Added on Thu, Dec 3, 2009
Question : What is the meaning of the following.. 1)If an input file has an excessive number of rows and can be split-up then use standard 2) logic to run jobs in parallel 3)Tuning should occur on a job-by-job basis. Use the... Read More
What about System variables??
Added on Thu, Dec 3, 2009
DataStage provides a set of variables containing useful system information that you can access from a transform or routine. System variables are read-only. @DATE The internal date when the program started. See the Date function. @DAY The... Read More
What is SQL tuning? how do you do it ?
Added on Thu, Dec 3, 2009
Sql tunning can be done using cost based optimization this parameters are very important of pfile sort_area_size , sort_area_retained_size,db_multi_block_count,open_cursors,cursor_sharing optimizer_mode=choose/role Read More
What is the flow of loading data into fact & dimensional tables?
Added on Thu, Dec 3, 2009
Here is the sequence of loading a datawarehouse. 1. The source data is first loading into the staging area, where data cleansing takes place. 2. The data from staging area is then loaded into dimensions/lookups. 3.Finally the Fact tables are... Read More
Types of vies in Datastage Director?
Added on Thu, Dec 3, 2009
There are 3 types of views in Datastage Director a) Job View - Dates of Jobs Compiled. b) Log View - Status of Job last run c) Status View - Warning Messages, Event Messages, Program Generated Messages. Read More
What is job control?how can it used explain with steps?
Added on Thu, Dec 3, 2009
JCL defines Job Control Language it is ued to run more number of jobs at a time with or without using loops. steps:click on edit in the menu bar and select ’job properties’ and enter the parameters asparamete prompt typeSTEP_ID STEP_ID... Read More
What is the difference between datastage and informatica
Added on Thu, Dec 3, 2009
The main difference is Vendors. Each one is having plus from their architecture. For Datastage it is a Top-Down approach. Based on the Businees needs we have to choose products. Read More
What are OConv () and Iconv () functions and where are they used?
Added on Thu, Dec 3, 2009
iconv is used to convert the date into into internal format i.e only datastage can understand example :- date comming in mm/dd/yyyy format datasatge will conver this ur date into some number like :- 740 u can use this 740 in derive in ur own... Read More
What is a project? Specify its various components?
Added on Thu, Dec 3, 2009
You always enter DataStage through a DataStage project. When you start a DataStage client you are prompted to connect to a project. Each project contains: DataStage jobs. Built-in components. These are predefined components used in a job. User... Read More
What are validations you perform after creating jobs in designer. What r the different type of errors u faced during loading and how u solve them
Added on Thu, Dec 3, 2009
Check for Parameters. and check for inputfiles are existed or not and also check for input tables existed or not and also usernames,datasource names, passwords like that Read More
What is version Control?
Added on Thu, Dec 3, 2009
Version Control stores different versions of DS jobs runs different versions of same job reverts to previos version of a job view version histories Read More
What is ’ insert for update ’ in datastage
Added on Thu, Dec 3, 2009
i think ’insert to update’ is updated value is inserted to maintain history Read More
How do you merge two files in DS?
Added on Thu, Dec 3, 2009
Either used Copy command as a Before- job subroutine if the metadata of the 2 files are same or created a job to concatenate the 2 files into one if the metadata is different. Read More
Tell me the environment in your last projects
Added on Thu, Dec 3, 2009
Give the OS of the Server and the OS of the Client of your recent most project Read More
How many jobs have you created in your last project?
Added on Thu, Dec 3, 2009
100+ jobs for every 6 months if you are in Development, if you are in testing 40 jobs for every 6 months although it need not be the same number for everybody Read More
How to run the job in command prompt in unix?
Added on Thu, Dec 3, 2009
Using dsjob command, -options dsjob -run -jobstatus projectname jobname Read More
How to parametarise a field in a sequential file?I am using Datastage as ETL Tool,Sequential file as source.
Added on Thu, Dec 3, 2009
We cannot parameterize a particular field in a sequential file, instead we can parameterize the source file name in a sequential file. Read More
What are the Steps involved in development of a job in DataStage?
Added on Thu, Dec 3, 2009
The steps required are: select the datasource stage depending upon the sources for ex:flatfile,database, xml etc select the required stages for transformation logic such as transformer,link collector,link partitioner, Aggregator, merge etc select... Read More
what is meaning of file extender in data stage server jobs. can we run the data stage job from one job to another job that file data where it is stored and what is the file extender in ds jobs.
Added on Thu, Dec 3, 2009
File extender means the adding the columns or records to the already existing the file, in the data stage, we can run the data stage job from one job to another job in data stage. Read More
How the hash file is doing lookup in serverjobs?How is it comparing the key values?
Added on Thu, Dec 3, 2009
Hashed File is used for two purpose: 1. Remove Duplicate Records 2. Then Used for reference lookups.The hashed file contains 3 parts: Each record having Hashed Key, Key Header and Data portion.By using hashed algorith and the key valued the lookup is... Read More
Can any one tell me how to extract data from more than 1 hetrogenious Sources. mean, example 1 sequenal file, Sybase , Oracle in a singale Job.s
Added on Thu, Dec 3, 2009
Yes you can extract the data from from two heterogenious sources in data stages using the the transformer stage it’s so simple you need to just form a link between the two sources in the transformer stage that’s itByeHamee Read More
What is NLS in datastage? how we use NLS in Datastage ?
Added on Thu, Dec 3, 2009
Question : What is NLS in datastage? how we use NLS in Datastage ? what advantages in that ? at the time of installation i am not choosen that NLS option , now i want to use that options what can i do ? to reinstall that datastage or first... Read More
Defaults nodes for datastage parallel Edition
Added on Thu, Dec 3, 2009
Actually the Number of Nodes depend on the number of processors in your system.If your system is supporting two processors we will get two nodes by default. Read More
What is DS Manager used for - did u use it?
Added on Thu, Dec 3, 2009
The Manager is a graphical tool that enables you to view and manage the contents of the DataStage Repository Read More
How do you pass filename as the parameter for a job?
Added on Thu, Dec 3, 2009
1. Go to DataStage Administrator->Projects->Properties->Environment->UserDefined. Here you can see a grid, where you can enter yourparameter name and the corresponding the path of the file. 2. Go to the stage Tab of the job, select the... Read More
What are the Job parameters?
Added on Thu, Dec 3, 2009
These Parameters are used to provide Administrative access and change run time values of the job. EDIT>JOBPARAMETERS In that Parameters Tab we can define the name,prompt,type,value Read More
What r XML files and how do you read data from XML files and what stage to be used?
Added on Thu, Dec 3, 2009
In the pallet there is Real time stages like xml-input,xml-output,xml-transformer Read More
How do you remove duplicates without using remove duplicate stage??
Added on Thu, Dec 3, 2009
In the target make the column as the key column and run the job. Read More
What are the difficulties faced in using DataStage ? or what are the constraints in using DataStage ?
Added on Thu, Dec 3, 2009
1)If the number of lookups are more? 2)what will happen, while loading the data due to some regions job aborts? Read More
What are Stage Variables, Derivations and Constants?
Added on Thu, Dec 3, 2009
Stage Variable - An intermediate processing variable that retains value during read and doesnt pass the value into target column. Derivation - Expression that specifies value to be passed on to the target column. Constant - Conditions that are... Read More
What is data file & descriptor file in dataset stage ? Plese expain briefly
Added on Thu, Dec 3, 2009
No answer available currently. Be the first one to reply to this question by submitting your answer from the form below. Read More
What is Modulus and Splitting in Dynamic Hashed File?
Added on Thu, Dec 3, 2009
The modulus size can be increased by contacting your Unix Admin. Read More
What are the command line functions that import and export the DS jobs?
Added on Thu, Dec 3, 2009
A. dsimport.exe- imports the DataStage components. B. dsexport.exe- exports the DataStage components. Read More
How do you do Usage analysis in datastage ?
Added on Thu, Dec 3, 2009
1. If u want to know some job is a part of a sequence, then in the Manager right click the job and select Usage Analysis. It will show all the jobs dependents. 2. To find how many jobs are using a particular table. 3. To find how many jobs are... Read More
Purpose of using the key and difference between Surrogate keys and natural key
Added on Thu, Dec 3, 2009
We use keys to provide relationships between the entities(Tables). By using primary and foreign key relationship, we can maintain integrity of the data. The natural key is the one coming from the OLTP system. The surrogate key is the artificial key... Read More
How to remove duplicates in server job
Added on Thu, Dec 3, 2009
1)Use a hashed file stage or 2) If you use sort command in UNIX(before job sub-routine), you can reject duplicated records using -u parameter or 3)using a Sort stage Read More
What is the difference between Datastage and Datastage TX?
Added on Thu, Dec 3, 2009
Its a critical question to answer, but one thing i can tell u that Datastage Tx is not a ETL tool & this is not a new version of Datastage 7.5. Tx is used for ODS source ,this much i know Read More
How can ETL excel file to Datamart?
Added on Thu, Dec 3, 2009
Open the ODBC Data Source Administrator found in the controlpanel/administrative tools. under the system DSN tab, add the Driver to Microsoft Excel. Then u’ll be able to access the XLS file from Datastage. Read More
How can we join one Oracle source and Sequential file?.
Added on Thu, Dec 3, 2009
Join and look up used to join oracle and sequential file Read More
Do u know about METASTAGE?
Added on Thu, Dec 3, 2009
MetaStage is used to handle the Metadata which will be very useful for data lineage and data analysis later on. Meta Data defines the type of data we are handling. This Data Definitions are stored in repository and can be accessed with the use of... Read More
What are Static Hash files and Dynamic Hash files?
Added on Thu, Dec 3, 2009
The hashed files have the default size established by their modulus and separation when you create them, and this can be static or dynamic. Overflow space is only used when data grows over the reserved size for someone of the groups (sectors) within... Read More
How do you track performance statistics and enhance it?
Added on Thu, Dec 3, 2009
Through Monitor we can view the performance statistics. Read More
My requirement is like this :
Added on Thu, Dec 3, 2009
Question : My requirement is like this : Here is the codification suggested: SALE_HEADER_XXXXX_YYYYMMDD.PSV SALE_LINE_XXXXX_YYYYMMDD.PSV XXXXX = LVM sequence to ensure unicity and continuity of file exchanges Caution, there will... Read More
Where we use link partitioner in data stage job?explain with example?
Added on Thu, Dec 3, 2009
We use Link Partitioner in DataStage Server Jobs.The Link Partitioner stage is an active stage which takes one input andallows you to distribute partitioned rows to up to 64 output links. Read More
It is possible to access the same job two users at a time in datastage?
Added on Thu, Dec 3, 2009
No, it is not possible to access the same job two users at the same time. DS will produce the following error : "Job is accessed by other user" Read More
What is job control?how it is developed?explain with steps?
Added on Thu, Dec 3, 2009
Controlling Datstage jobs through some other Datastage jobs. Ex: Consider two Jobs XXX and YYY. The Job YYY can be executed from Job XXX by using Datastage macros in Routines. To Execute one job from other job, following steps needs to be followed... Read More
How to implement type2 slowly changing dimenstion in datastage? give me with example?
Added on Thu, Dec 3, 2009
Slow changing dimension is a common problem in Dataware housing. For example: There exists a customer called lisa in a company ABC and she lives in New York. Later she she moved to Florida. The company must modify her address now. In general 3 ways... Read More
What is the difference between drs and odbc stage
Added on Thu, Dec 3, 2009
To answer your question the DRS stage should be faster then the ODBC stage as it uses native database connectivity. You will need to install and configure the required database clients on your DataStage server for it to work. Dynamic Relational... Read More
Is it possible to move the data from oracle ware house to SAP Warehouse using with DATASTAGE Tool.
Added on Thu, Dec 3, 2009
We can use DataStage Extract Pack for SAP R/3 and DataStage Load Pack for SAP BW to transfer the data from oracle to SAP Warehouse. These Plug In Packs are available with DataStage Version 7.5 Read More
What is data set? and what is file set?
Added on Thu, Dec 3, 2009
File set:- It allows you to read data from or write data to a file set. The stage can have a single input link. a single output link, and a single rejects link. It only executes in parallel modeThe data files and the file that lists them are called a... Read More
What are the differences between the data stage 7.0 and 7.5 in server jobs?
Added on Thu, Dec 3, 2009
There are lot of Diffrences: There are lot of new stages are available in DS7.5 For Eg: CDC Stage Stored procedure Stage etc.. Read More
It is possible to run parallel jobs in server jobs?
Added on Thu, Dec 3, 2009
No, It is not possible to run Parallel jobs in server jobs. But Server jobs can be executed in Parallel jobs Read More
What user varibale activity when it used how it used !where it is used with real example
Added on Thu, Dec 3, 2009
By using This User variable activity we can create some variables in the job sequnce,this variables r available for all the activities in that sequnce. Most probablly this activity is @ starting of the job sequnce Read More
How can u implement slowly changed dimensions in datastage? explain?
Added on Thu, Dec 3, 2009
Question : How can u implement slowly changed dimensions in datastage? explain? 2) can u join flat file and database in datastage?how? Answers: Yes, we can do it in an indirect way. First create a job which can populate the data... Read More
What is merge ?and how to use merge?
Added on Thu, Dec 3, 2009
Merge is a stage that is available in both parallel and server jobs. The merge stage is used to join two tables(server/parallel) or two tables/datasets(parallel). Merge requires that the master table/dataset and the update table/dataset to be... Read More
What are the Repository Tables in DataStage and What are they?
Added on Thu, Dec 3, 2009
A datawarehouse is a repository(centralized as well as distributed) of Data, able to answer any adhoc,analytical,historical or complex queries.Metadata is data about data. Examples of metadata include data element descriptions, data type descriptions... Read More
What is DS Director used for - did u use it?
Added on Thu, Dec 3, 2009
Datastage Director is GUI to monitor, run, validate & schedule datastage server jobs. Read More
What is DS Designer used for - did u use it?
Added on Thu, Dec 3, 2009
You use the Designer to build jobs by creating a visual design that models the flow and transformation of data from the data source through to the target warehouse. The Designer graphical interface lets you select stage icons, drop them onto the... Read More
Dimensional modelling is again sub divided into 2 types.
Added on Thu, Dec 3, 2009
a)Star Schema - Simple & Much Faster. Denormalized form. b)Snowflake Schema - Complex with more Granularity. More normalized form. Read More
Did you work in UNIX environment?
Added on Thu, Dec 3, 2009
some times u need to write unix progrms in back round ! like batch progms ! bcz data stage can invoke a batch processing in every 24 hrs . soo.......unix must... so that we can run the unix prog in back round even min/ hrs Read More
What is the default cache size? How do you change the cache size if needed?
Added on Thu, Dec 3, 2009
Default read cache size is 128MB. We can incraese it by going into Datastage Administrator and selecting the Tunable Tab and specify the cache size over thereregardsjagan Read More
What does a Config File in parallel extender consist of?
Added on Thu, Dec 3, 2009
Config file consists of the following. a) Number of Processes or Nodes. b) Actual Disk Storage Location. Read More
How much would be the size of the database in DataStage ? What is the difference between Inprocess and Interprocess ?
Added on Thu, Dec 3, 2009
In-process You can improve the performance of most DataStage jobs by turning in-process row buffering on and recompiling the job. This allows connected active stages to pass data via buffers rather than row by row. Note: You cannot use in-process... Read More
How do you populate source files?
Added on Thu, Dec 3, 2009
There are many ways to populate one is writting SQL statment in oracle is one way Read More
Types of Parallel Processing?
Added on Thu, Dec 3, 2009
Parallel Processing is broadly classified into 2 types. a) SMP - Symmetrical Multi Processing. b) MPP - Massive Parallel Processing. Read More
What are the most important aspects that a beginner must consider doin his first DS project ?
Added on Thu, Dec 3, 2009
He should be good at DataWareHousing Concepts and he should be familiar with all stages Read More
Where actually the flat files store?what is the path?
Added on Thu, Dec 3, 2009
Flat files stores the data and the path can be given in general tab of the sequential file stage Read More
How to use rank&updatestratergy in datastage
Added on Thu, Dec 3, 2009
Don’t mix informatica with Datastage. In Datastage, we dont have such kind of stages. Read More
What about System variables?
Added on Thu, Dec 3, 2009
Question : 1.What about System variables? 2.How can we create Containers? 3.How can we improve the performance of DataStage? 4.what are the Job parameters? 5. what is the difference between routine and transform and function? ... Read More
If I add a new environment variable in Windows, how can I access it in DataStage?
Added on Thu, Dec 3, 2009
U can view all the environment variables in designer. U can check it in Job properties. U can add and access the environment variables from Job properties Read More
DataStage from Staging to MDW is only running at 1 row per second! What do we do to remedy?
Added on Thu, Dec 3, 2009
I am assuming that there are too many stages, which is causing problem and providing the solution. In general. if you too many stages (especially transformers , hash look up), there would be a lot of overhead and the performance would degrade... Read More
Can we use shared container as lookup in datastage server jobs?
Added on Thu, Dec 3, 2009
I am using DataStage 7.5, Unix. we can use shared container more than one time in the job.There is any limit to use it. why because in my job i used the Shared container at 6 flows. At any time only 2 flows are working. can you please share the info... Read More
If your running 4 ways parallel and you have 10 stages on the canvas, how many processes does datastage create?
Added on Thu, Dec 3, 2009
Answer is 40 You have 10 stages and each stage can be partitioned and run on 4 nodes which makes total number of processes generated are 40 Read More
What is the OCI? and how to use the ETL Tools?
Added on Thu, Dec 3, 2009
OCI doesn’t mean the orabulk data. It actually uses the "Oracle Call Interface" of the oracle to load the data. It is kind of the lowest level of Oracle being used for loading the data. Read More
Importance of Surrogate Key in Data warehousing?
Added on Thu, Dec 3, 2009
The concept of surrogate comes into play when there is slowely changing dimension in a table. In such condition there is a need of a key by which we can identify the changes made in the dimensions. These slowely changing dimensions can be of three... Read More
How do you eliminate duplicate rows?
Added on Thu, Dec 3, 2009
The Duplicates can be eliminated by loading thecorresponding data in the Hash file. Specify the columns on which u want to eliminate as the keys of hash. Read More
What is DS Administrator used for - did u use it?
Added on Thu, Dec 3, 2009
The Administrator enables you to set up DataStage users, control the purging of the Repository, and, if National Language Support (NLS) is enabled, install and manage maps and locales. Read More
How can we create Containers?
Added on Thu, Dec 3, 2009
There are Two types of containers 1.Local Container 2.Shared Container Local container is available for that particular Job only. Where as Shared Containers can be used any where in the project. Local container: Step1:Select the stages required... Read More
What is the utility you use to schedule the jobs on a UNIX server other than using Ascential Director?
Added on Thu, Dec 3, 2009
"AUTOSYS": Thru autosys u can automate the job by invoking the shell script written to schedule the datastage jobs. Read More
What will you in a situation where somebody wants to send you a file and use that file as an input or reference and then run job.
Added on Thu, Dec 3, 2009
A. Under Windows: Use the ’WaitForFileActivity’ under the Sequencers and then run the job. May be you can schedule the sequencer around the time the file is expected to arrive. B. Under UNIX: Poll for the file. Once the file has start... Read More
Explain the differences between Oracle8i/9i?
Added on Thu, Dec 3, 2009
Mutliproceesing,databases more dimesnionsal modeling Read More
How good are you with your PL/SQL?
Added on Thu, Dec 3, 2009
u will not be writtinf pl/ sql in datastage ! sql knowledge is enough ... Read More
What other ETL’s you have worked with?
Added on Thu, Dec 3, 2009
Ab-initio datasatge EE parllel edition oracle -Etl there are 7 ETL in market ! Read More
What is Hash file stage and what is it used for?
Added on Thu, Dec 3, 2009
We can also use the Hash File stage to avoid / remove dupilcate rowsby specifying the hash key on a particular fileld Read More
What is environment variables?what is the use of this?
Added on Thu, Dec 3, 2009
Basically Environment variable is predefined variable those we can use while creating DS job.We can set eithere as Project level or Job level.Once we set specific variable that variable will be availabe into the project/job. We can also define new... Read More
Functionality of Link Partitioner and Link Collector?
Added on Thu, Dec 3, 2009
server jobs mainly execute the jobs in sequential fashion,the ipc stage as well as link partioner and link collector will simulate the parllel mode of execution over the sever jobs having single cpu Link Partitioner : It receives data on a single... Read More
Containers : Usage and Types?
Added on Thu, Dec 3, 2009
Container is a collection of stages used for the purpose of Reusability. There are 2 types of Containers. a) Local Container: Job Specific b) Shared Container: Used in any job within a project. ? There are two types of shared container:? 1.Server... Read More
What is the purpose of exception activity in data stage 7.5?
Added on Thu, Dec 3, 2009
The stages followed by exception activity will be executed whenever there is an unknown error occurs while running the job sequencer. Read More
What are constraints and derivation?
Added on Thu, Dec 3, 2009
Question : * What are constraints and derivation? * Explain the process of taking backup in DataStage? *What are the different types of lookups available in DataStage? Answers: Constraints are used to check for a condition and... Read More
What are orabulk and bcp stages?
Added on Thu, Dec 3, 2009
ORABULK is used to load bulk data into single table of target oracle database. BCP is used to load bulk data into a single table for microsoft sql server and sysbase. Read More
Does Enterprise Edition only add the parallel processing for better performance? Are any stages/transformations available in the enterprise edition only?
Added on Thu, Dec 3, 2009
? DataStage Standard Edition was previously called DataStage and DataStage Server Edition. ? DataStage Enterprise Edition was originally called Orchestrate, then renamed to Parallel Extender when purchased by Ascential. ? DataStage Enterprise: Server... Read More
What is difference between serverjobs & paraller jobs
Added on Thu, Dec 3, 2009
Here is the diff Server jobs. These are available if you have installed DataStage Server. They run on the DataStage Server, connecting to other data sources as necessary. Parallel jobs. These are only available if you have installed ... Read More
What is merge and how it can be done plz explain with simple example taking 2 tables .......
Added on Thu, Dec 3, 2009
Merge is used to join two tables.It takes the Key columns sort them in Ascending or descending order.Let us consider two table i.e Emp,Dept.If we want to join these twotables we are having DeptNo as a common Key so we can give that column name as key... Read More
What happends out put of hash file is connected to transformer .. What error it throughs
Added on Thu, Dec 3, 2009
If Hash file output is connected to transformer stage the hash file will consider as the Lookup file if there is no primary link to the same Transformer stage, if there is no primary link then this will treat as primary link itself. you can do SCD in... Read More
How can we pass parameters to job by using file.
Added on Thu, Dec 3, 2009
You can do this, by passing parameters from unix file, and then calling the execution of a datastage job. the ds job has the parameters defined (which are passed by unix) Read More
How can we implement Lookup in DataStage Server jobs?
Added on Thu, Dec 3, 2009
The DB2 stage can be used for lookups. In the Enterprise Edition, the Lookup stage can be used for doing lookups. Read More
What are Sequencers?
Added on Thu, Dec 3, 2009
A sequencer allows you to synchronize the control flow of multiple activities in a job sequence. It can have multiple input triggers as well as multiple output triggers.The sequencer operates in two modes:ALL mode. In this mode all of the inputs to... Read More
Did you Parameterize the job or hard-coded the values in the jobs?
Added on Thu, Dec 3, 2009
Always parameterized the job. Either the values are coming from Job Properties or from a ?Parameter Manager? ? a third part tool. There is no way you will hard?code some parameters in your jobs. The often Parameterized variables in a job are: DB DSN... Read More
Differentiate Database data and Data warehouse data?
Added on Thu, Dec 3, 2009
By Database, one means OLTP (On Line Transaction Processing). This can be the source systems or the ODS (Operational Data Store), which contains the transactional data. Read More
which one is the better partition to stages for performance in Enterprise edition?
Added on Thu, Dec 3, 2009
It mainly depends on the volume of data in source and were in lookup we prepare entire partition . so finally it depends on the stage we are using and size of the data and also on the requirements. Read More
Briefly describe the various client components?
Added on Thu, Dec 3, 2009
There are four client components DataStage Designer. A design interface used to create DataStage applications (known as jobs). Each job specifies the data sources, the transforms required, and the destination of the data. Jobs are compiled to... Read More
<Hash> How does DataStage handle the user security?
Added on Thu, Dec 3, 2009
we have to create users in the Administrators and give the necessary priviliges to users. Read More
1* what is the diffrance betwin DNS DHCP ? file system of windows? compare with nis & dns
Added on Thu, Dec 3, 2009
No answer available currently. Be the first one to reply to this question by submitting your answer from the form below. Read More
|