A few Recommendations for a Data Scientist who wants to get started in Recommender Systems

Juan Arévalo

d&a blog, data processing, RecSys

As a Data Scientist, you are expected to be able to build all sort of data products, that may involve simple-yet highly valuable business trends extracted through data querying and cleansing; and sometimes, more sophisticated Machine Learning algorithms for prediction, classification, or even recommendation. However, the cold start in a specific topic may be tough for Data Scientists, especially for …

Self-Service Performance Tuning for Hive

Angel Puerto

d&a blog, data processing

Hive is a very powerful data warehouse framework based on Apache Hadoop. The two together provide stable storing and processing capabilities for big data analysis. In this article, we will analyze how to monitor metrics, tune and optimize the workflow in this environment with Dr. Elephant. Hive is designed to enable easy data summarization, ad-hoc queries, and big data analysis. …