February 29 - March 2, 2012
Montreal, Canada

Oracle In-database Hadoop: When MapReduce Meets RDBMS

The MapReduce programming model lets developers without experience with parallel and distributed
systems utilize the resources of a large, multi-CPU system. The Oracle RDBMS has had support for the MapReduce paradigm for years through SQL analytics, user defined pipelined table functions and aggregation objects. The Apache Hadoop implements the MapReduce model.

In this session, we describe a prototype of Oracle in-database Hadoop implementation that lets you
write and execute Hadoop compatible applications written in Java directly in the database.
The major advantages of our implementation include:
(1) source compatibility with Hadoop,
(2) minimal dependency on the Apache Hadoop infrastructure,
(3) seamless integration of MapReduce functionality in Oracle SQL
(4) better parallelism and efficiency due to data pipelining (i.e., table functions) and no intermediate materialization.

View all 161 sessions

Kuassi Mensah

Oracle Corporation

Kuassi is Director of Product Management at Oracle.
Products under his watch include Database Access frameworks (Net Services, DRCP), Database APIs (JDBC, PHP, Ruby, Python, Perl) and Java in the database.
Mr Mensah holds a MS and post graduate in Computer Sciences from the Programming Institute of University of Paris VI. He is a frequent speaker at IT events and has published several articles and a book http://www.amazon.com/exec/obidos/ASIN/1555583296
Blog http://db360.blogspot.com

Read More

Comments

Please remain courteous and constructive. Comments will be moderated.

Presented by

Sponsored by

Media