How?

jhp123 · on April 12, 2023

use a regex like (aardvark|apple|...|zebra)* and the standard DFA construction for regular expressions.

behdad · on April 12, 2023

You are technically right. Though if we assume that the size of the dictionary is at least o(n), then the size of such DFA will be exponential in n, and indexing it will be o(n), resulting in a o(n^2) solution again, I think.

jhp123 · on April 12, 2023

The number of states is exponential for a general regex. This regex has an NFA closely resembling the trie for the dictionary. There could be at most one live state per depth in the trie. So the number of possible states is bounded at least by (words in dictionary)^(length of longest word). Really much smaller because the states at different levels have to share prefixes.

ghusbands · on April 12, 2023

They are completely correct. If the DFA fits in RAM, following state transitions will be O(1) and using the DFA for a concatenated string of length N will be O(N). You simply missed a good solution.

bonzini · on April 12, 2023

Without the *, you get a graph that is a compressed version of the trie. With the *, the size of the DFA is extremely likely to explode.