Life     Entertainment

Wednesday, May 31, 2006

DataStage

Sample DataStage Certification Questions:

1. Which DataStage EE client application is used to manage roles for DataStage projects?
A. Director
B. Manager
C. Designer
D. Administrator
2. Importing metadata from data modeling tools like ERwin is accomplished by which facility?
A. MetaMerge
B. MetaExtract
C. MetaBrokers
D. MetaMappers
3. Which two statements are true of writing intermediate results between parallel jobs to persistent data sets? (Choose two.)
A. Datasets are pre-indexed.
B. Datasets are stored in native internal format.
C. Datasets retain data partitioning and sort order.
D. Datasets can only use RCP when a schema file is specified.
4. You are reading customer data using a Sequential File stage and sorting it by customer ID using the Sort stage. Then the sorted data is to be sent to an Aggregator stage which will count the number of records for each customer.
Which partitioning method is more likely to yield optimal performance without violating the business requirements?

A. Entire
B. Random
C. Round Robin
D. Hash by customer ID
5. A customer wants to create a parallel job to append to an existing Teradata table with an input file of over 30 gigabytes. The input data also needs to be transformed and combined with two additional flat files. The first has State codes and is about 1 gigabyte in size. The second file is a complete view of the current data which is roughly 40 gigabytes in size. Each of these files will have a one to one match and ultimately be combined into the original file.
Which DataStage stage will communicate with Teradata using the maximum parallel performance to write the results to an existing Teradata table?

A. Teradata API
B. Teradata Enterprise
C. Teradata TPump
D. Teradata MultiLoad
6. Which column attribute could you use to avoid rejection of a record with a NULL when it is written to a nullable field in a target Sequential File?
A. null field value
B. bytes to skip
C. out format
D. pad char
7. You are reading customer records from a sequential file. In addition to the customer ID, each record has a field named Rep ID that contains the ID of the company representative assigned to the customer. When this field is blank, you want to retrieve the customers representative from the REP table.
Which stage has this functionality?
A. Join Stage
B. Merge Stage
C. Lookup Stage
D. No stage has this functionality.
8. You want to ensure that you package all the jobs that are used in a Job Sequence for deployment to a production server.
Which command line interface utility will let you search for jobs that are used in a specified Job Sequence?

A. dsjob
B. dsinfo
C. dsadmin
D. dssearch
9. Your job is running in a grid environment consisting of 50 computers each having two processors. You need to add a job parameter that will allow you to run the job using different sets of resources and and computers on different job runs.
Which environment variable should you add to your job parameters?

A. APT_CONFIG_FILE
B. APT_DUMP_SCORE
C. APT_EXECUTION_MODE
D. APT_RECORD_COUNTS
10. Which two statements are valid about Job Templates? (Choose two.)
A. Job Templates can be created from any parallel job or Job Sequence.
B. Job Templates should include recommended environment variables including APT_CONFIG_FILE.
C. Job Templates are stored on the DataStage development server where they can be shared among developers.
D. The locatation where Job Templates are stored can be changed within DataStage Designer Tools - Options menu.
Answer Key:
1. D
2. C
3. B and C
4. D
5. B
6. A
7. C
8. D
9. A
10. A and B

0 Comments:

Post a Comment

<< Home