SAP BusinessObjects XI 3.1 Demonstration Walkthrough Part 1
This week I presented at the Mastering BusinessObjects 2010 conference, hosted in Melbourne, Australia. The conference was organised by Eventful Management, who did a fantastic job. International presenters included Timo Elliott, Ingo Hilgefort, Dave Rathbun, Rajeev Kapur and Tom Nather. You can see some photos of the event by Timo here.
My topic was called ‘SAP BusinessObjects XI 3.1 End-to-End Demonstration’, in which I wanted to do a live demonstration of one dataset being used with most of the BOBJ tools, including Data Integrator, Universe Designer, Web Intelligence, Xcelsius, Live Office, QaaWS and Explorer.
Here is a link to the PowerPoint deck, though there’s not many slides.. (Fletcher_EndtoEndSBO_MBO)
I want to provide a walkthrough of the presentation, and include links to the datasets I used. This should hopefully allow anyone (who has access to the SAP BusinessObjects XI 3.1 environment) to follow my steps.
To begin with, I introduced the dataset I used. I downloaded it from http://data.australia.gov.au/, and it shows the Number of Crimes in New South Wales, Australia, broken down by Local Government Authority (LGA), Month/Year (over 12 years), Offence Category and Subcategory. All in all it represents over 1.7 million records, so not a small dataset. I also extended the dataset by introducing State Region (multiple LGA’s belong to a Region) in another CSV file.
To start with, the first part of the presentation was to use Data Integrator to import data from two CSV files into a SQL Server database, including de-normalising the data into a dimensional model. You can download the CSV files from the Datasets page of my blog, under the NSW Crimes heading. You will also need to create a database (I used SQL Server) to host the tables in the Target Datastore.
Here are the steps I following to build the ETL job to move the data from the CSV files to the SQL Server database.
- Create a new Flat File Format for the ‘NSW Crimes Data 95-08.csv’ (CSV#1) for extracting data
- Create a new Flat File Format for the ‘Regions and LGAs.csv’ (CSV#2) for extracting data
- Create a new database to store the extracted data in, and create a new Datastore (I called mine DEMO_E2E_SBO)
- Create a new Project and Job
- Within the new Job, create a new dataflow for staging the main source file, using a template table
- Run the job to stage the data
- Import the template table and set to overwrite
- Add a new dataflow for creating the DIM_DATE table, using the Date_Generation transform
- Set the date generation from 1995.01.01 to 2008.12.31, monthly
- Add the following output columns to the Query transform
- Date_SK (julian of Date)
- Year
- Quarter
- Month
- Date
- Add a new dataflow for DIM_LGA, using the CSV#1 flat file format as a source
- Add the following output columns to the Query transform
- LGA_SK (use gen_row_number function)
- State_Region (leave blank for now)
- Local_Government Authority
- Group by the LGA column
- Add in the CSV#2 as another source, combining in another Query transform, with the following changes
- Populate State_Region
- Make sure the WHERE clause includes LGA=LGA
- Add the following output columns to the Query transform
- Create a new dataflow for DIM_OFFENCE_TYPE, using the following columns in the Query transform
- Offence_Type_SK (use gen_row_num to populate)
- Offence Category
- Offence Subcategory
- Make sure to GROUP BY Offence Category and Subcategory
- Run the job to create and load the dimension tables, then import each of the dimension tables and set them to overwrite values
- Create a dataflow for loading the FACT_CRIMES
- Add in a pivot transform, using the following settings
- Pivot sequence column called ‘Seq_Number’
- Non-pivot Columns
- LGA
- Offence Category
- Offence Subcategory
- Pivot Columns
- All month columns
- Data field column called ‘Number_of_Crimes’
- Header column called’ Month’
- Add in a query transform, with settings as follows
- Add the following columns
- Crime _SK (use gen_row_num to populate)
- Date_SK – use this mapping = julian(to_date(’01 ‘ || Pivot.Month,’DD MON YYYY’))
- LGA_SK – use lookup_ext function to populate with the LGA_SK from the DIM_LGA table
- Offence_Type_SK – use lookup_ext function to populate with the Offence_Type_SK from the DIM_OFFENCE_TYPE table
- Number_of_Crimes – direct mapping
- Add the following columns
- Run the job
- Import fact table and set to overwrite, and you’re done!
I’ve also exported the entire Target Datastore to an Access DB (also available on the the Datasets page of my blog), as well as exporting the entire job to an ATL file which you can also download. I created the job in Data Services XI 3.2, so the ATL file won’t be compatible with versions prior to that.
I hope that someone finds the above of value if they are learning Data Services for the first time. In the next post I will cover the rest of the presentation, so stay tuned!



