he Big Data & Integration Summit was a success and our presentations are now available to the public for viewing. http://ow.ly/q64hz
Pervasive has recently developed an effective utility for migrating Data Integrator v9 projects into Pervasive Data Integrator v10. The process is quick and relatively smooth; however, there is the potential for challenges to arise due to the complex nature of most DI projects. If you are thinking about transitioning from v9 to v10, please reach out to Emprise to learn how our team of Certified Pervasive Developers can help your transition to v10 be successful.
Emprise Technologies is proud to be a Platinum sponsor of Pervasive IntegrationWorld 2013. We are also sponsoring the Data Clinic. If you are going to be at IntegrationWorld, come by the Data Clinic and ask one of our Pervasive certified consultants questions about Data Integrator. Bring your toughest Data Integration questions: The Emprise team has collective 30,000 hours of Pervasive work, so we doubt you’ll be able to stump us. But we’re open to your trying! See you at IntegrationWorld 2013. We’ll be in the Hyatt Hill Country Ballroom A-C from 10:15 a.m. until 4:00 p.m. on Monday, April 15 and again on Tuesday, the 16, from 9:20 a.m. – 12:00 p.m.
Pervasive Data Integrator can be a powerful tool, enabling multiple connections between a wide variety of systems and data points. The Repository Explorer is the starting point for every Data Integrator project. To get started on your first project, you must understand how to configure your Workspaces and Repositories through Repository Explorer.
What is the Repository Explorer?
The Repository Explorer is the starting point for all Pervasive Data Integrator projects. From this one application, developers can navigate to projects, open existing Pervasive Data Integrator elements (Maps, Processes, Structured Schemas, etc), or create new instances of those elements.
Who uses the Repository Explorer?
The Repository Explorer is used almost exclusively by developers, but it can also be used by quality assurance resources to access and review code that has already been developed.
How to Configure Repository Explorer?
Opening Repository Explorer
Once installed, Repository Explorer can be accessed like any other program on Windows.
- Open the Start menu
- Select ‘All Programs’
- Select Pervasive folder
- Select Data Integrator 9 folder
- Select Repository Explorer 9 program
Setting up a Workspace
The Repository Explorer organizes files using two methods. The first method is via a Workspace. A Workspace is a collection of one ore many repositories and a single Macrodef.xml file that is specific to the Workspace. (Note: For further information on the Macrodef.xml file, check out our two videos on Macro Definition Variables). At Emprise, our best practice is to create one Workspace for each project. This allows us to specific a unique Macrodef file for each project.
To create a new Workspace, one just has to follow a few simple steps.
- Select File from the menu bar
- Select Manage Workspaces…
- Click the drop down to the right of Workspaces Root Directory and navigate to the location you would like to save your workspace in. Hit OK. The Workspace Root Directory is the location where the Workspace folder will be created. Inside of this folder a set of mandatory, default files and folders will be created.
- Xmldb – This is the folder used for the default repository when creating a new Workspace.
- Fileref.xml – A list of file references used by the workspace.
- Macrodef.xml – The macrodef file. For further information see our video.
- Repositores.xml – A list of all repositories for the Workspace. This directory is rarely changed after initial setup.
- Select Add
- To add an existing workspace, check the box for the proper workspace.
- If adding a new Workspace, which will often be the case, click the ‘Create New Workspace” button. You will be prompted for a name. Give your workspace a descriptive name and then click “OK” to return to the previous screen.
- Click OK
- Find the Workspace you just added, click the name to highlight it, then click the Set Current button on the right. This activates the Workspace, allowing you access to its repositories.
*Note: You can also right click on the white space to the left of the screen that displays your current repository and select the ‘Manage Workspaces’ option from there. Also, when creating a new Workspace using the “Create New Workspace” option, the Macrodef.xml file from the current Workspace will be copied into the directory of the new Workspace. This includes the Macro names and values. This is helpful when standard Macros are used, but is something to pay attention to in regards to directory paths and file names.
Setting up a Repository
Repositories are directory paths pointing to where the Pervasive DI project files will be located. A Workspace can have any number of Repositories, which are displayed in a tree view on the left side of the Repository Explorer. Only Pervasive files located in one of the Repositories for the Workspace can be opened and edited from the Repository Explorer.
- If you are not currently working in the workspace within which you want to create a repository, navigate to that workspace using File -> Manage Workspaces or right click on the white space on the left that displays the file directory structure and select Manage Workspaces. Then, click the text of the Workspace you would like to use to highlight it before clicking the Set Current button on the right.
- Select File form the menu bar
- Select Manage Repositories…
- Click the Add… button to create a new repository. At this point you can either navigate to the folder you would like to select as the Repository or paste in the file path as copied from Windows Explorer.
Note: You can also right click on the white space to the left of the screen that displays your current repository and select the ‘Manage Repositories’ option from there.
- Use a standard naming convention for the Workspaces. This allows for easy identification of what project the Workspace is for.
- Use a standard data structure and directory path for the Repository folder. This prevents issues that may arise when multiple developers work on the same code base. Pervasive Data Integrator projects use a series of pointers within the files and by standardizing the repository paths you prevent these pointers from being corrupted when moving code between developers.
- After creating a new Workspace it is prudent to open the Macrodef.XML file and remove unneeded Macros and change others to match your new project. Please see our video on the MacroDef for further information.
When using an Excel document as a source, the header row is used to determine the names of the fields for the source connector. In order to ensure the field names will be consistent, one can insert a row into the beginning of each document before it is processed. For example, today our client sent us an Excel document that contained the following header in column A: “Account Number”, but yesterday, the value in column A was: “Acct Num”.
Dynamically inserting a static header row into an Excel document allows for the processing of Excel documents regardless of whether or not they contain a consistent header row from the client.
This should be done when you are asked to process an Excel document that is missing a header row or does not have a consistent header row.
The use of consistent column headers is beneficial to the:
- Developer – Implements the code to add a header row to the Excel document prior to processing.
- End User – Is able to review and utilize the new data loaded into the system.
Inserting a header row to an Excel document is implemented using a RIFL step within Pervasive Data Integrator Process Designer.
Before the Excel document is processed, use a RIFL step to open the document and insert a static header row at the beginning of the document that matches the column names identified in the Map’s source schema. If the file may come in with or without a header, you can add a Source Filter to your Map that processes the Excel document. The filter can validate each row to filter out any extra or unwanted header rows and rows that only contain whitespace, which will allow you to process the data in the document successfully.
Subscribe to the Emprise Technologies YouTube channel to access our library of video demos.