Category Posts Navigation

Char-RNN

Posted by Marcus Zillman

Char-RNN
https://github.com/karpathy/char-rnn

This code implements multi-layer Recurrent Neural Network (RNN, LSTM, and GRU) for training/sampling from character-level language models. In other words the model takes one text file as input and trains a Recurrent Neural Network that learns to predict the next character in a sequence. The RNN can then be used to generate text character by character that will look like the original training data. The context of this code base is described in detail in my blog post. If you are new to Torch/Lua/Neural Nets, it might be helpful to know that this code is really just a slightly more fancy version of this 100-line gist that I wrote in Python/numpy. The code in this repo additionally: allows for multiple layers, uses an LSTM instead of a vanilla RNN, has more supporting code for model checkpointing, and is of course much more efficient since it uses mini-batches and can run on a GPU. This will be added to Artificial Intelligence Resources Subject Tracer™.

Leave a Reply

Facebook Comments

Sign up for Awareness Watch Newsletter

* = required field

Browse Categories