Implementation of the handwriting synthesis experiments in the paper Generating Sequences with Recurrent Neural Networks by Alex Graves. The implementation closely follows the original paper, with a few slight deviations, and the generated samples are of similar quality to those presented in the paper.
Web demo is available here.
Use python 3.6.8 and ensure packages from requirements.txt
are installed. You can launch the GUI with gui.py
, or with gui.exe
from the releases if on windows
or use the Hand
class from demo.py
as seen below
lines = [
"Now this is a story all about how",
"My life got flipped turned upside down",
"And I'd like to take a minute, just sit right there",
"I'll tell you how I became the prince of a town called Bel-Air",
]
biases = [.75 for i in lines]
styles = [9 for i in lines]
stroke_colors = ['red', 'green', 'black', 'blue']
stroke_widths = [1, 2, 1, 2]
hand = Hand()
hand.write(
filename='img/usage_demo.svg',
lines=lines,
biases=biases,
styles=styles,
stroke_colors=stroke_colors,
stroke_widths=stroke_widths
)
Currently, the Hand
class must be imported from demo.py
. If someone would like to package this project to make it more usable, please contribute.
A pretrained model is included, but if you'd like to train your own, read these instructions.
Below are a few hundred samples from the model, including some samples demonstrating the effect of priming and biasing the model. Loosely speaking, biasing controls the neatness of the samples and priming controls the style of the samples. The code for these demonstrations can be found in demo.py
.
The following samples were generated with a fixed style and fixed bias.
Smash Mouth – All Star (lyrics)
The following samples were generated with varying style and fixed bias. Each verse is generated in a different style.
Vanessa Carlton – A Thousand Miles (lyrics)
The following samples were generated with a fixed style and varying bias. Each verse has a lower bias than the previous, with the last verse being unbiased.
Leonard Cohen – Hallelujah (lyrics)
This project was intended to serve as a reference implementation for a research paper, but since the results are of decent quality, it may be worthwile to make the project more broadly usable. I plan to continue focusing on the machine learning side of things. That said, I'd welcome contributors who can:
- Package this, and otherwise make it look more like a usable software project and less like research code.
- Add support for more sophisticated drawing, animations, or anything else in this direction. Currently, the project only creates some simple svg files.