Etymo AI newsletter #12

11th January - 24th January 2019

1361 new papers

In this newsletter from Etymo, you can find out the latest development in machine learning research, including the most popular datasets used, the most frequently appearing keywords, the important research papers associated with the keywords, and the most trending papers in the past two weeks.

If you and your friends like this newsletter, you can subscribe to our fortnightly newsletters here.

Fortnight Summary

The number of papers published in the past two weeks is back to 1361, the level we have observed for most part of 2018. Computer vision (CV) is still a main research area, as reflected on the popularity of the CV datasets and the most trending papers.

We present the emerging interests in research under the "Trending Phrases" section. The papers in this section show some cutting edge results. There are two good papers related to Attribute Recognition, an automatically generated description of people's appearance. There is also a new and more consistent algorithm to learn high-quality Node Representation in social and information networks. A very good summary on techniques to tackle False Information is also included. Please read the "Trending Phrases" section for more details.

Other notable development in research includes the following:

A new Neural Architecture Search to include network level structure search for better semantic image segmentation: Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation

A new Transformer network to learn language dependency beyond a fixed-length context: Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

An investigation on fundamental requirements for the application of classifier patching on neural networks, an approach for adapting neural network models to handle concept drift in nonstationary environments: Towards Neural Network Patching: Evaluating Engagement-Layers and Patch-Architectures

An answer to a cross-network node classification problem by leveraging the abundant labeled information from a source network to help classify unlabeled nodes in a target network: Network Embedding for Cross-network Node Classification

Some of the notable review papers include:

Deep Learning for Anomaly Detection: A Survey

Taking Human out of Learning Applications: A Survey on Automated Machine Learning

Evolving the pulmonary nodules diagnosis from classical approaches to deep learning aided decision support: three decades development course and future prospect

And some discussion on the direction of machine learning in general include:

A framework to increase the transparency of machine learning technology: Model Cards for Model Reporting

A new principle for exploring context in machine learning models: Context Aware Machine Learning

A framework to extract reproducible knowledge from data that can guide scientific hypothesis generation and experimental design: Three principles of data science: predictability, computability, and stability (PCS)

Discussion about the impact of data science on politics: Data Science as Political Action: Grounding Data Science in a Politics of Justice

Popular Datasets

Computer vision is still the main focus area of research.

Name	Type	Number of Papers
MNIST	Handwritten Digits	77
ImageNet	Image Dataset	49
CIFAR-10	Tiny Image Dataset in 10 Classes	32
COCO	Common Objects in Context	20
Cityscapes	Images from 50 different cities	13
KITTI	Autonomous Driving	12
CelebA	Large-scale CelebFaces Attributes	11

Trending Phrases

In this section, we present a list of phrases that appeared significantly more in this newsletter than the previous newsletters.

Attribute Recognition

The Deeper, the Better: Analysis of Person Attributes Recognition
Video-Based Pedestrian Attribute Recognition

Node Representations

Attributed Network Embedding via Subspace Discovery

False Information

Combating Fake News: A Survey on Identification and Mitigation Techniques

Etymo Trending

Presented below is a list of the most trending papers added in the last two weeks.

Deep Learning for Anomaly Detection: A Survey:
The 50-page paper groups state-of-the-art research techniques into different categories based on the underlying assumptions and approach adopted. Within each category, the authors outline the basic anomaly detection technique, along with its variants and present key assumptions, to differentiate between normal and anomalous behavior, as well as the advantages and limitations of each technique.

Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation:
The author Neural Architecture Search (NAS) for semantic image segmentation, an important computer vision task that assigns a semantic label to every pixel in an image. The authors propose to search the network level structure in addition to the cell level structure, which forms a hierarchical architecture search space. They present a network level search space that includes many popular designs, and develop a formulation that allows efficient gradient-based architecture search (3 P100 GPU days on Cityscapes images). They demonstrate the effectiveness of the proposed method on the challenging Cityscapes, PASCAL VOC 2012, and ADE20K datasets.

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context:
This paper proposes a novel neural architecture, Transformer-XL, that enables Transformer to learn dependency beyond a fixed length without disrupting temporal coherence. It consists of a segment-level recurrence mechanism and a novel positional encoding scheme. This method not only enables capturing longer-term dependency, but also resolves the problem of context fragmentation. The code, pretrained models, and hyperparameters are available in both Tensorflow and PyTorch.

Frequent Words

"Learning", "Model", "Data" and "Network" are the most frequent words. The top two papers associated with each of the key words are:

Learning

Model

Data

Network