We propose a graph-based representation learning framework for video
summarization. First, we convert an input video to a graph where nodes
correspond to each of the video frames. Then, we impose sparsity on the graph
by connecting only those pairs of nodes that are within a specified