In this paper, we consider the scene parsing problem. We propose a novel \textbf{M}ulti-\textbf{P}ath \textbf{F}eedback recurrent neural network (MPF-RNN) to enhance the capability of RNNs on modeling long-range context information at multiple levels and better distinguish pixels that are easy to confuse in pixel-wise classification. In contrast to CNNs without feedback and RNNs with only a single feedback path, MPF-RNN propagates the contextual features learned at top layers through weighted recurrent connections to \emph{multiple} bottom layers to help them learn better features with such "hindsight". Besides, we propose a new training strategy which considers the loss accumulated at multiple recurrent steps to improve performance of the MPF-RNN on parsing small objects as well as stabilize the training procedure. We empirically demonstrate that such an architecture with multiple feedback paths can effectively enhance the capability of deep neural networks in classifying pixels which are hard to distinguish without higher-level context information. With these two novel components, MPF-RNN provides new state-of-the-art results on four challenging scene parsing benchmarks, including SiftFlow, Barcelona, CamVid and Stanford Background.

本研究提出一种新的MPF-RNN模型，通过多层反馈和多重加权循环结构，增强RNN建模长范围的上下文信息和识别易混淆像素的能力，此外还提出了考虑多次反馈的损失累积策略，从而在场景分割方面实现了比传统模型更优秀的性能表现。

多路径反馈循环神经网络用于场景解析