Seminarium Zakładu Logiki Matematycznej
2008-10-10, 14:15 s. 5820
|
|
Zakład Logiki Matematycznej zaprasza na seminarium zakładowe w piątki
o godz. 14:15 (MIM UW, sala 5820, ul. Banacha 2, Warszawa).
Bieżące i archiwalne informacje o seminariach zostały zaktualizowane na
stronie:
http://www.mimuw.edu.pl/~jakubw/seminarium.html
W najbliższy piątek 10.10.2008, godz. 14:15 (MIM UW, sala
5820) planowane jest wystąpienie:
"Open Source Edition of Infobright's Data Warehouse"
Dominik Ślęzak
STRESZCZENIE:
The theory of rough sets provides a powerful model for representation of
patterns and dependencies, applicable both in databases and data mining.
On the one hand, although there are numerous rough set applications to
data mining and knowledge discovery, the usage of rough sets inside the
database engines is still quite an uncharted territory. On the other
hand, however, this situation is not so exceptional given that even the
most well-known paradigms of machine learning, soft computing,
artificial intelligence, and approximate reasoning are still waiting for
more recognition in the database research.
Rough set-based algorithms and similar techniques can be applied to
improve database performance in several ways. We focus on the idea of
using available information to calculate rough approximations of data
needed to resolve queries and to assist the database engine in accessing
relevant data. We partition data onto rough rows, each consisting of 64K
of original rows. We automatically label rough rows with compact
information about their values on data columns, often involving
multi-column and multi-table relationships. One may say that we create
new information systems where objects correspond to rough rows and
attributes - to various flavors of rough information.
In this talk, we show how the above ideas guided us toward implementing
the fully functional data warehouse product, with interfaces provided
via integration with MySQL and internals based on the newest database
trends. Thanks to compact, flexible rough information, we became
especially competitive in the field of analytical data warehouses, where
users want to query terabytes of data in a complex, dynamically changing
way. Recently, we announced at www.infobright.org the open source
edition of our data warehouse, ready for free usage and further
extensions. In the talk, we illustrate the best scenarios of applying
our software to various aspects of data processing. We also discuss the
most promising directions for further improvement of our technology,
with a special attention to the ideas based on the theory of rough sets
and corresponding techniques.