30th November - 14th December 2018

1881 new papers

This is our last newsletter for 2018. The Etymo team wish you a great holiday and see you in the new year!

In this newsletter from Etymo, you can find out the latest development in machine learning research, including the most popular datasets used, the most frequently appearing keywords, the important research papers associated with the keywords, and the most trending papers in the past two weeks.

You can subscribe to our fortnightly newsletters here.

Fortnight Summary

Computer vision (CV) is still a main research area, as reflected on the popularity of the CV datasets and the most trending papers.

One of the most popular paper in the last two weeks is the revised version of (GAN Dissection: Visualizing and Understanding Generative Adversarial Networks), which was breifly mentioned in our newsletter summary from last week. The introductory collection of deep reinforcement learning (An Introduction to Deep Reinforcement Learning) is also popular. The paper co-authored by Google Brain and researchers about a software module for uncertainty functions based on Bayseian neural networks (Bayesian Layers: A Module for Neural Network Uncertainty) also gains a significant amount of attention.

We also present the emerging interests in research under the "Trending Phrases" section. The papers in this section shows some cutting edge results. There are two good papers about rule learning: Learning Interpretable Rules for Multi-label Classification advocates a rule-based approach to multi-label classification, while That's Mine! Learning Ownership Relations and Norms for Robots presents a robotic system capable of representing, learning, and inferring ownership relations and norms. One of the applications of Graph-based Security and Privacy Analytics via Collective Classification with Joint Weight Learning and Propagation can also be used to detect fake reviews with a better accuracy. A Survey on Trust Modeling from a Bayesian Perspective gives a global comprehensive review on the variants of Bayesian trust models.

For other noticable development, there are new reviews and summaries of existing machine learning knowledge, such as Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning, A Tale of Three Probabilistic Families: Discriminative, Descriptive and Generative Models, and Neural Approaches to Conversational AI. Progressive Sampling-Based Bayesian Optimization for Efficient and Automatic Machine Learning Model Selection shows progress in efficiently identifying high-quality solutions required by many machine learning-based clinical data analysis tasks. Hybrid Microaggregation for Privacy-Preserving Data Mining introduces a better anonymization technique in privacy-preserving data mining. MixTrain: Scalable Training of Verifiably Robust Neural Networks to defend against adversarial inputs into neural networks. Wireless Network Intelligence at the Edge talks about distributed, low-latency and reliable machine learning at the wireless network edges.

Popular Datasets

Computer vision is still the main focus area of research.

Name Type Number of Papers
MNIST Handwritten Digits 98
ImageNet Image Dataset 63
CIFAR-10 Tiny Image Dataset in 10 Classes 52
COCO Common Objects in Context 30
CelebA Large-scale CelebFaces Attributes 26
KITTI Autonomous Driving 21
Cityscapes Images from 50 different cities 18

Trending Phrases

In this section, we present a list of words/ phrases that appeared significantly more in this newsletter than the previous newsletters.

Etymo Trending

Presented below is a list of the most trending papers added in the last two weeks.

  • GAN Dissection: Visualizing and Understanding Generative Adversarial Networks:
    The authors present an analytic framework to visualize and understand GANs at the unit-, object-, and scene-level. They show several practical applications enabled by the framework, from comparing internal representations across different layers, models, and datasets, to improving GANs by locating and removing artifact-causing units, to interactively manipulating objects in a scene. The authors also provide open source interpretation tools to help researchers and practitioners.

  • An Introduction to Deep Reinforcement Learning:
    This 140-page muanuscript provides an introduction to deep reinforcement learning models, algorithms and techniques. Particular focus is on the aspects related to generalization and how deep RL can be used for practical applications.

  • Bayesian Layers: A Module for Neural Network Uncertainty:
    This paper is co-authored by Google Brain and researchers. It is motivated by research on priors and algorithms for Bayesian neural networks, scaling up Gaussian processes and expressive distributions via invertible functions. This paper describes Bayesian Layers, an extension of neural network libraries which contributes the idea to enable distributions over functions instead of only deterministic functions as “layers”.

Frequent Words

"Learning", "Model", "Data" and "Training" are the most frequent words. The top two papers associated with each of the key words are:

Hope you have enjoyed this newsletter! If you have any comments or suggestions, please email or