A large enough data set can break the Behat test generation due to character length limits in Google Sheets. ("Text result of JOIN is longer than the limit of 50000 characters.") This could be fixed by refactoring the test generation--perhaps by writing a custom function. It could also be sidestepped by using CSV files instead of Gherkin data tables.