I would like to create sample data where each record pulls from a dataset and replicates the distribution of status codes. Say 70% “200” codes, 5% “404” codes and so on…
Would it be easier to simply create multiple schemas and merge them, or must there be a better way?