My field of research is in real-time data warehouseing. I am currently working on the READY project which goal is to explore trade-offs between satisfying multiple users data requirements in a cost effective way but with a reasonable time to market. My interests are around distributed systems, stream processing and databases.
I also contribute to different open source projects, specifically in the Apache Software Foundation.
Apache Gora: It is an in-memory processing framework which abstract different NoSQL data model into a simple key-value one.
Apache Samza: It is the streaming transformation platform using Apache Kafka as its main transportation layer.
Apache Nutch: It is a distributed web crawler leveraring Hadoop for spawning tasks. It mainly does wide crawls but it is being enhanced to provide more vertical crawls.
Apache Giraph: It is an iterative graph processing system built for high scalability.
Renato Javier Marroquín Mogrovejo, José Maria Monteiro, Javam C. Machado, Carlos Juliano M. Viana, Sérgio Lifschitz
Experimental statistical analysis of MapReduce Joins
SBBD 2013, Recife, Pernambuco, Brazil, 2013, [Paper]