Zugg Software :: View topic - IDEA:Optimized Stringlists and Database Variables in Triggers

Enchanter Joined: 05 Mar 2003 Posts: 593 Location: Canada

Inspired by this thread
I was thinking you could make an option to optimize stringlist and database in triggers by to reduce the number of backtrackings necessary to match and more importantly the number of backtracking necessary to fail.

Using a Patricia trie like the one shown here

Using that example you could take a string list containing :
romane
romanus
romulus
rubens
ruber
rubicon
rubicundus

Which normally would would make a regex like

Posted: Wed May 07, 2008 5:19 am

I can only imagine how ugly it would be for my thousand+ strong list..

Would it not be timely to form the regex you speak of from the list?
Especially when the list continues to expand?

Posted: Wed May 07, 2008 6:38 am

Well, I wouldn't be surprised if the PCRE library is already doing that kind of optimization itself, so I don't think CMUD needs to do it too. But Vijilante might have more insight into this. Remember that PCRE has a "compile" step that converts the initial regular expression into a compiled pattern. I think this is the point where PCRE is building this kind of search tree internally. CMUD keeps the trigger compiled until you change it, so this is basically already being done.

SubAdmin Joined: 18 Nov 2001 Posts: 5182

The algorithm used for regex matching actually specifies a list syntax as an if..else chain. It is sometimes important to use the order of the items of the list to control matching, and the PCRE respects that.

With a short set of words like you have shown the gains would be small, but they would be there. With longer phrases the gains would be more noticeable, but with longer phrases you can often find common portions to group together and making some portions optional achieves nearly the same effect. Those small gains from controlling backtracing and the match path would probably be matched by a loss of speed from the extra layers of nesting. Making the net gain/loss near 0.

Enchanter Joined: 05 Mar 2003 Posts: 593 Location: Canada

Wizard Joined: 14 Aug 2004 Posts: 1269

So, the PCRE hasn't compiled an optimal version of