Wednesday, September 18, 2013

Multi-Dimensional Modelling


Considering various disadvantages of relational model EF Codd founded the 12 basic principles of a multidimensional data presentation as :-*1

1. Multi-Dimensional Conceptual View: Business-analyst "sees the company world" multivariate and multi-dimensional, accordingly and conceptual data model representation in OLAP product should be multivariate and multi-dimensional on a nature, that allow analysts to fulfil intuitive operations: "slice and dice", rotate and pivot directions of consolidation.

2. Transparency: The user should not know what concrete resources are used for storage and data processing and how the data are organized. Without dependence from that, the OLAP-product a part of resources of the user is whether or not, this fact should be transparent for the user. If OLAP it is granted by client - server calculations this fact also, whenever possible, should be imperceptible for the user. OLAP should be granted in a context of open architecture, allowing the user where he was to communicate through the analytical tool with the server. In addition transparency should be achieved in interaction of the analytical tool with homogeneous / heterogeneous databases.

3. Accessibility: Business analyst should have a possibility to analyze within the framework of the common conceptual scheme, thus the data may remain under the control of old, "inherited" DBMS, being thus pegged to common analytical model. So OLAP tool kit should superimpose its own logic scheme on physical data arrays, fulfilling all conversions required for support of an uniform, agreed and complete "user sight" on the information.

4. Consistent Reporting Performance: With increasing of numbers of measures and database size analysts should not face with any decrease of productivity. Stable productivity is necessary for maintaining an usage simplicity which is required for finishing OLAP up to the end user. If the user - analyst will test essential distinctions in productivity according to number of measures then he will try to compensate these distinctions the strategy of development that will call data representation other ways, but not with what it is really necessary to present the data. Costs of time to bypass the system for compensation of its inadequacy is not what analytical products are intended for.

5. Client-Server Architecture: Large data volumes, required operating analytical processing stored on mainframes, but extracted from PC. Therefore one of requests - ability of OLAP products to operate in client - server environment. Main idea here is that OLAP tool server component should be intelligent enough and can build the common conceptual scheme based on generalization and consolidations of various logical and physical schemes of corporate databases.

6. Generic Dimensionality: All measures should be equivalent. Additional performances may be given to separate measures, but as all of them are symmetric, the given additional functionality may be given to any measure. Base data structure, formulas and report formats should not base on any one measurement and should not be displaced aside to any measure. Each measure should be applied irrespectively to its structure and operational abilities. Additional operational abilities may be granted to any selected measure, and as measures are symmetric, any function may be given to any measure.

7. Dynamic Sparse Matrix Handling: OLAP tool should guarantee optimal processing of the sparse matrixes. Access speed should be saved without dependence from data cells layout and to be a constant for the models having different number of measures and different data sparse.

8. Multi-User Support: Frequently some analysts have the necessity to work simultaneously with one analytical model or to create various models based on the same data. OLAP tool should grant them competitive access, guarantee integrity and data protection.

9. Unrestricted Cross-dimensional operations: Data calculation and manipulation on any number of measures should not prohibit or limit any ratios among data cells. The conversions requiring arbitrary definition, should be set in functionally complete formula language.

10. Intuitive Data Manipulation: Directions consolidation, detailing data in columns and rows, aggregation and other data manipulations inherent to hierarchy structure , should be executed in maximum convenient, natural and comfortable user interface.

11. Flexible Reporting: Various data visualization methods should be supported, other word reports should be presented in any possible orientation.

12. Unlimited Dimensions and Aggregation Levels: Strongly recommended, that each serious OLAP tool should have a minimum of 15 (better more than 20 measures in analytical model. Moreover, each of these measures should admit practically unlimited amount of aggregation levels, defined by user, on any direction of consolidation.


Basic OLAP functionality
1)      Selection of Dimension – slice and dice.
2)      View from multiple perspective - Multiple views in data, allow pivoting and in figurative formats.
3)      Allow filtration of data based on selection.
4)      Drill-Down to lower level and Roll-Up to higher level, i.e. range all aggregation level.

Basic Terms and definition:-
1)      Cube: Is a multiple dimensional structure (can be more than 3) that stores pre-computed values of data (measures) to provide faster retrieval of results when aggregated across various dimension.
2)      Measures: Is a numeric value collected from facts which can be aggregated across multiple dimensions.
3)      Dimension: are a broader group of descriptive data, which is like an index to measure something, it can be broader section of a business aspect.
4)      Levels: are the hierarchy within dimensions, which can be drilled down to get details of below levels.
5)      Scope: Are the way to create arbitrary, predefined subsets of a dimension at a grain level.
6)      Member: are the data values of a business aspect.

References:-

Nature of Business Intelligence Application

Business users are always wary of how to use the unstructured data in their systems in more innovative manner to make better decision.

So, any business intelligence applications should satisfy the users in following way:-
1.       Should be able to identify problems or Opportunities faster. E.g.: It is like looking at reports and telling good student and bad student.
2.       Should be able to relate matters in a meaningful sense and channel them using proper communication medium like graph, bars, charts etc.
3.       Should be able to provide quality information to the relevant stake holders.
4.       Should have decision support systems like forecasting, goal seeking, and scenario manager.
5.       Should be able to gather information, analyse it and put the processed information into the reports in structure manner. It should say where are we now, where are we heading to and where should we go.


Main Functions of Business Intelligence systems are enlisted below:-*1
1.       Access up to date Information
2.       Email relevant information
3.       Word processing
4.       Spread sheet modelling
5.       Trend Analysis
6.       Graphical Information
7.       Drill-Down Facilities
8.       Tabular Information
9.       Exception Reporting
10.   Textual Information
11.   Calenders
12.   Access to External database
13.   External News Access
14.   Automated filing
15.   Computer conferencing

Problems that users faced are listed below:-*2

1.       Users weren’t trained properly
2.       Lack of access to external data
3.       Failed to meet objectives and user requirement
4.       Wasn’t easy to use
5.       Lack of sponsorship
6.       Too long to develop

Mintzberg’s three informational processing roles of management:-
1)      Monitor: During this role, one interact and find information related to business process in the organisation and look for relevant changes in environment .One continuously monitor the team in terms of productivity and well-being.
2)      Disseminator: In this role they communicate potentially useful information with the team and other relevant people.

3)      Spokesperson: In this role managers speak for the organisation ,transmitting information about the organisation to people outside it.

Saturday, August 31, 2013

Introduction to BI and OLAP


What is Business Intelligence?


BI stands for Business Intelligence, there can be many definitions to describe what is BI?
Business users don’t know:  what is data, where is data, how it comes and where it goes, what they understand is reading REPORTS that make sense for Business.
So, BI can be defined as a Decision support system (DSS) which helps in collating data, processing it and making available in sensible report format that Business users can read, understand and take proper decision based on inputs from REPORTS.
BI is and fancy term used now, it existed in 70’s,80’s and 90’s and people use to call it Manager Information System(MIS) and Executive Information System (EIS).
In Most systems today, data is present but in a unstructured way, difficult to comprehend or make sense out of it and how BI helps is to structure the data into proper information which can be presented to different business users to run the business process, monitor the progress and status of various business processes, forecast things like growth rate, sales, consumption info based on past data, etc.
BI systems are Online Analytical Processing (OLAP) Databases which typically differs in many ways from normal Enterprise Resource Planning (ERP) or Customer Relationship Management (CRM) systems which are OLTP databases.

Major Difference between OLTP and OLAP


CATEGORY
OLTP
OLAP
DATA SOURCE
Normal input or Transactional Data
Data for OLAP comes from OLTP database.
DATA CONTENT
Live Database system with latest updates.
Multi-Dimensional Views of PAST data.
PROCESSING SPEED
Fast processing.
Slow processing due to complex aggregation increases computational time
DATA RETRIVAL
Data can be retrieved by simple select queries.
Complex Multidimensional expression (MDX)
DATABASE DESIGN
Database is normalised to avoid redundancy.
Focus is on performance hence denormalised data.


Characteristic of BI application characteristic *1


1.       It should provide summary reports of critical operational data of the business process.
2.       Should provide selected details of critical operation
3.       Should have “DRILL DOWN” i.e. it should have flexible, intuitive navigation between various reporting points.
4.       Should have high quality ‘VISUAL’ presentation so that it can be easily understood by business user i.e. use of Graphics, Icons, Colour coding and hotspots.
5.       Minimal use of Keyboard, the executives should be able to make out sense from the reports, charts, graphs, etc.
6.       It should have external data sources.

Key Issues in BI system Development *2


1.       Sponsor may lose interest in due to dissatisfaction in product development.
2.       Unclear Role of IT.
3.       Complex system Architecture , designing of Multi-Dimensional view of data(OLAP)
4.       Requirement Elicitation
a.       Comprehending Unstructured information
b.      Issues faced by developer like finding time to work with executives ,changing and unsupportable requirement.





REFERENCE:

*1 Reference: Slide 31/32 of FIT5093 week 1.
*2 Reference: Slide 34/35 of FIT5093 week 1.

Nesting of Virtualisation by Hyper-V

Hyper-V allows VISUALIZATION inside Window Server 2008 R2

Steps to Enable hyper-V in Window server 2008 R2 

Kindly make the hardware settings as follows :-



To enable Hyper-V in guest OS of Window server 2008 R2 follow these steps.
Add Roles : select Hyper – v
If you get the below message –“HYPER –V cannot be installed “then check your BIOS if Virtualisation is enabled or not and check the settings given above carefully.


And do a change in configuration file (.vmx), add  parameter: hypervisor.cpuid.v0 = “FALSE”


Now it should work.


REFERENCE : http://www.veeam.com/blog/nesting-hyper-v-with-vmware-workstation-8-and-esxi-5.html

OS Visualization (Steps to install Window Server 2008 R2 in Window 7)



First we need to understand why we need Visualization
Consider simple situations like you have a Windows7 operating system, and you want to experience the look and feel of a MAC system. What you going to do? In most cases people might think of buying a new machine and install MAC operating system. This whole affair becomes very expensive and carries additional financial burden.
In these cases Visualization can be installed in host operating system of Window 7 and MAC OS can be loaded as guest OS.
This was a very simple situation, but in industry where people work in cutting edge technologies across different platforms virtualisation is very useful.

What is Visualization?
In simple terms it is like enabling a system to act in multiple forms although the hardware is unique.
This is done by adding a layer of abstraction between the host operating system and guest operating system. This abstraction is done by a visualization software like Virtualbox , VMware,etc.

Installing Of VMware
Download virtualisation workstation software from website.
Take special care in downloading the right software x86 or x64 bit choose the one same as host OS . Click on run and install the software.

Installing an Environment (Guest OS) in Host OS
Primary requirement is that the hardware must allow virtualisation depends on the processor you are using.
Follow the link to check your processor: http://www.intel.com/support/processors/sb/cs-030729.htm
Once verified that the hardware allows, enable the virtualisation in BIOS and proceed below steps. Screenshot of Lenovo Z580 attached below.

PIC-1: ENABLING OF VISUALIZATION IN BIOS

After enabling in BIOS follow below steps to install Window Server 2008 R2:

Step 1: Create a new Virtual Machine

Step 2: Choose TYPICAL and Click NEXT

Step 3: Choose Installer from Disc and Browse the location

Step 4: choose the one you like to select enter the windows product key if you have.

Step 5: Enter the Name of Virtual machine and select where you like to place it.

Step 6: set the disk size and choose store as a single file


Step 7: Click on Finish


The operating system will start booting for first time.