This is a script to generate random names or words based off a list of real names/words. The data comes from various publicly available sources (US census data, dictionary files, forum userlists, etc), and is fed into a script to read the probability of small chunks of text. These probabilities are then used to randomly generate a statistically likely word based off that data. Additionally, any randomly generated words are checked against their source data and discarded if it's found that the word is actually not random. |