Home
Derde generatie Business Intelligence
19 Nov 2007

Mark Whitehorn beschrijft in een artikel de toepassing van 'derde generatie' BI. Daar waar het in de 1e generatie slechts ging om het begrijpen van het verleden, ging het in de 2e generatie om het analyseren en begrijpen waarom bepaalde zaken gebeuren (om vervolgens verbeteringen door te voeren). De derde generatie gaat nog verder en business intelligence brengt hier de informatie direct bij de medewerker die in contact staat met de klant. Dit vraagt om BI voor een grote groep medewerkers ('BI for the masses') en een zeer snelle verwerking van informatie-aanvragen (realtime). In het artikel gaat hij ook in op de technologiën die deze ontwikkeling mogelijk hebben gemaakt.

 

Uit het artikel:

[...]

Why BI had to evolve

The traditional face of BI needs to change because the way in which we are using BI is changing. For a start, analysis is getting more complex, according to Ellen Rubin, vice president of marketing with data warehouse appliance vendor Netezza.

"Until recently, [our customers] have been looking for fraud after it has occurred," Rubin said. "Now we can look much further into the future, asking not just 'What patterns can we see in the data?' but 'What patterns are we likely to see in the future?' " 

But it is also about adding a very different type of usage, according to IBM's Andrews.

"We would characterize BI as having three generations," he said. "The first generation was about understanding the past. The second was about analyzing why things happened and making recommendations about the future. That's better than first, but I still liken this to driving a car by looking in the rear view mirror. The new, third generation is about making information available to the people in front of the customer."

This is a truly significant shift in the way enterprises use data warehouses. First- and second-generation systems needed to support a limited number of people who ran large, complex analytical queries. The third generation must support not only more complex queries from the same analysts but also a new workload that consists of thousands of users running very different queries. These may well be complex, but each is likely to hit a relatively small set of data within the warehouse.

Combining these very different workloads is non-trivial. So is it worth the effort? The Hudson's Bay Company, based in Toronto, certainly thinks so.

Hudson's Bay Company embraces third-generation BI

Sadly, not all shoppers are honest. The National Retail Federation estimates that retailers lose about $16 billion a year to returns fraud -- dishonest customers presenting stolen items for refund or using a sales receipt multiple times. Detecting the patterns of fraudulent returns after they have occurred is second-generation territory. But catching the offender at the checkout with the receipt in his hand illustrates very clearly the difference that a third-generation system can make.

By combining a data warehouse from Dayton, Ohio-based Teradata with IBM's WebSphere as middleware and BI software from McLean, Va.-based MicroStrategy Inc., the Hudson's Bay Company was able to update the data warehouse with sale, return, exchange and void data almost instantaneously. Essentially, it is now impossible for a receipt to be reused or for merchandise to be returned fraudulently.

These third-generation systems are not pie in the sky. The Hudson's Bay Company rolled out this system across all its stores and, within five months, the savings had delivered a 100% ROI, according to Mary-Jane Jarvis-Haig, senior manager of business intelligence development and support for Hudson's Bay.

"The cost savings have been huge," Jarvis-Haig said. "We're already exceeding our targeted benefits."

[...]

OK, so far it seems that third-generation business intelligence (BI) and data warehouse systems are the future. But third-generation BI requires our systems to run both huge analytical queries and transaction-like queries -- such as those from customer-facing employees -- with equal aplomb.

This is a problem we have seen before. For the last decade or so, we solved this problem by separating the analytical and transaction systems, because combining them was too difficult. Very early on in the history of BI, we learned that analytical and transactional queries made very different demands on the query engines. So great were the differences that we made copies of the transactional data, moved it into the data warehouse and restructured it there in order to allow the analytical queries to run more effectively. Now, we are expecting our third-generation BI systems to cope with analytical and transaction-like queries in the same system.

How has business intelligence and data warehouse software changed in the interim?

Such third-generation systems have learned from the data warehouse appliances, so they can and do use massively parallel processing (MPP) and in-memory querying. But query workload management is also important. This technology carefully distributes the resources available to the system across the individual queries -- ensuring, for example, that time-critical queries, such as those from customer-facing employees, are allocated enough resources to allow them to complete in a timely fashion. Vendors such as Hewlett-Packard believe this is very important, according to Greg Battas, distinguished technologist, BI group with HP.

"HP sees workload management as absolutely vital," Battas said. "To put that into some kind of perspective, about 25% of our database development people are currently working on workload management."

The market is also seeing new hybrid systems emerging from vendors like IBM and HP with characteristics of both the traditional data warehouse and the data warehouse appliance.

The three core technologies driving this change are:

  • MPP on commodity hardware
  • In-memory querying
  • Query workload management

These hybrid systems service not only the needs of analysts but also of a whole new layer of BI users within the company, which is remarkable. BI is finally moving from elite to egalitarian and perhaps, as analysts have long predicted, becoming pervasive throughout the enterprise.

The front end of business intelligence software -- and the audience -- changes

As third-generation systems move BI out to a much wider range of employees, it must become integrated into the software stack that those employees use on a daily basis. In some cases, this means rewriting custom software, but for many it means integrating BI into Microsoft's Office applications.

To this end, Microsoft has put massive efforts into BI, integrating Office 2007 with its own back-end BI tools. Analysis Services cubes can appear in Excel, you can data mine from within Excel or even (the mind does boggle a little here) from within Visio. However, Microsoft is simply one BI vendor; there are plenty of others out there. Somewhat surprisingly, Microsoft and other vendors, such as Teradata, are working together.

This cooperation not only allows Office components to reach data stored in Teradata, it also allows Microsoft's BI tools to do the same. Given the diversity of BI vendors, this is not a phenomenon that's going to disappear and, in this case, it is Software-as-a Service (SaaS) that is the key enabling technology. Service-oriented architecture (SOA) has been an essential technology in facilitating the cooperation, according to Ed White, director of product marketing with Teradata.

Bar charts, pie charts, yawn charts: Data visualization is changing

Pie charts date back to about 1800 and, useful as they are, we can do better, as the work of respected data visualization researchers Edward Tufte and William Cleveland shows. Several companies -- including Spotfire (now part of Tibco), QlikTech, Thinkmap, Tableau and others -- have been looking at this work and producing truly original ways of displaying complex data. I believe that this will have a profound influence on BI over the coming years, and others -- such as Roger Oberg, vice president, Spotfire product strategy with Palo Alto, Calif.-based Tibco Software Inc. -- agree with me.

"New trends like in-memory processing, 'free dimensional' ad hoc queries, and user definable workflows are democratizing BI," Oberg said. "We are moving from a world in which we push data that is often ignored to a world in which interaction massively increases the data's usefulness and therefore the number of people who want to use it."

As he points out, none of that will be effective unless people can visualize the data easily.

Yes, but what does it all mean for my business intelligence and data warehouse software?

Data is just numbers and text. One important lesson we have learned from BI is that keeping track of the meaning of the data is far, far more complex than we originally thought. In one sense, this isn't a technical problem, it's a human one -- since only humans can decide questions of meaning. However, some companies have been actively trying to address not only how we track meaning but how we track it over time, according to Cliff Longman, chief technology officer with Burlington, Mass.-based Kalido Inc.

"Fluidity of meaning is the problem that Kalido addresses," Longman said. "We find that users can get results if they are allowed to deal with a higher level of abstraction – higher than the logical model. Kalido makes data reusable even if the meaning changes over time."

And finally, though technical approaches differ, everyone I spoke to agreed on two important, related points:

1. Data volumes are growing, year on year. Two years ago, data warehouse vendor Kognitio was looking to scale its systems down to 200 GB for some customers, according to Roger Gaskell, product development director. Now, almost every proof-of-concept Kognitio does is above 5 TB, with most in the 50 to 250 TB range.

2. BI is no longer the preserve of the large enterprise; it has moved to the small/medium-sized enterprise. Ten years ago, Microsoft's vision was "BI for the masses," according to Amir Netz, product unit manager for Analysis Services -- and in recent years, other experts have often espoused the benefits of BI for small and medium-sized businesses.

 

Bron: www.searchdatamanagement.com

 

 

 

Corso Vacatures