Figure 10: Networks trained for a width of 4 boards applied multiple times across a width of 10 boards performed substantially worse (in terms of rows solved) than a network trained for a width of 10 boards.