Zugg Software :: View topic - FASTER string lists and database variables in CMUD v2.0!

Posted: Mon Jul 23, 2007 10:21 pm

OK, I have some database variable benchmarks for you today.

Here are the test scripts:

Posted: Tue Jul 24, 2007 1:29 am

It would be nice if I could someday figure out what causes my scripting languages to be so slow (compared to Lua). In theory, CMUD and Lua are very similar: both use an internal compiled byte-code language. Scripts are compiled on-the-fly and then the byte-code is executed.

In the tests that I gave above, I tested my method using a trivial loop:

Posted: Tue Jul 24, 2007 1:54 am

Before making the same mistake that OldGuy2 did, I realized that I should really be comparing the Lua time with the optimized release (non-debug) times for CMUD. So, I compiled the new version using my release build script, and here are the *real* timing comparisons between the 1.34 release and the 2.00 release:

String Lists:

v1.34 makelist: 578 ms
v2.00 makelist: 656 ms
v2.00 makelist: 47 ms (with beginupdate/endupdate)

v1.34 searchlist: 359 ms
v2.00 searchlist: 265 ms
v2.00 searchlist: 47 ms (after using #SORT list command)

Database variables:

v1.34 makedb: 1015 ms
v2.00 makedb: 4093 ms
v2.00 makedb: 63 (with beginupdate/endupdate)

v1.34 searchdb: 360 ms
v2.00 searchdb: 31 ms

This is on an AMD Sempron 3100+ (1.80 Ghz) system with 2.0GB of RAM.

I'm a little confused about the speed *decrease* in the "makedb" alias on v2.0. I'm not sure why the basic routine without beginupdate/endupdate is so much slower. It must be the overhead in adding values to the hash table, so I'll need to look a bit closer at this.

Anyway, looks like it's a factor of 10 improvement, and not a factor of 100.

Novice Joined: 26 Apr 2007 Posts: 44

Do the data records in Cmud 2.00 support lookups in relatively constant time?

Posted: Tue Jul 24, 2007 12:49 pm

Posted: Tue Jul 24, 2007 4:28 pm

Yes, there are several things I can do to implement better timing...but it's never been a big enough issue to worry about in CMUD. When I care about real time profiling, I have a Delphi profiler tool that I use for serious stuff. The above stuff is good enough for basic comparisons.

Forren: I'm not sure what you mean...I've already said that CMUD 2.0 is using a hash table for the database variables, so doesn't that already mean that lookups are done in relatively constant time?

Also, before anyone gets confused, none of these changes have anything to do with the "Database Module" that is in CMUD. This module still doesn't use SQL and still hasn't been rewritten. This is a big job and won't be done for several months. The above changes only pertain to using database variables within zScript, such as using #ADDKEY, %db, %iskey, etc. I haven't even tested the existing database module yet to see if it still works with the above changes to the hash table.

Novice Joined: 26 Apr 2007 Posts: 44

Wizard Joined: 17 Jun 2006 Posts: 1201

Hehe :-) Leave it to me to screw something up. It is odd that the times actually increase on makelist and makedb without the beginupdate and endupdate.

If I am just adding items to an afflictions stringlist one at a time through an alias, would it even be beneficial to use beginupdate and endupdate and based upon this slight decrease in speed will my system run slower since this is how I track afflictions?

Before anyone jumps on me, I never said I was an expert. I'm just trying to understand. Thanks.

Posted: Wed Jul 25, 2007 2:25 am

It'll probably be beneficial to use #nosave since you probably won't log off with any afflictions, but it seems like #beginupdate and #endupdate are for when you're doing a lot of updates all at once. It might benefit if you're adding more than one affliction with the same trigger or alias, but other than that it'll probably cause problems because you might try curing an affliction without #endupdating (see this post). Perhaps you could do something where you always #endupdate the variable before you try pulling a value from it to cure anything and you might see a speed benefit there, but it's quite a complex problem. It'd probably need testing to see how it'd work.

Regardless, if you're worried about the speed decreases because you use database variables, you might find it beneficial to switch to string lists. It's probably worth testing all this stuff with your particular scripts.

Wizard Joined: 17 Jun 2006 Posts: 1201

Well I pretty much figured it was for adding a large amount of updates, but wondered about in a case like you pointed out where it adds a few at once. Some triggers do add about 3 afflictions at once, but I add them to the afflictions stringlist with a simple afflict alias such as afflict a, afflict b, afflict c and so on. Then to cure them I use #switch and %ismember to run through the list and cure. I am sure there is a better way to do it. However, I am not expert and this seemed the easiest way to me. So basically in my case I wouldn't really get any benefit out of it. By the way, I don't use a database just a simple stringlist. I was going to use databases but they seemed to be much slower originally.

With the way some classes of characters are in the MUD I play they afflict at tremendous speed so any speed benefit I can find I try to use it. Thanks for the response Fang.

Posted: Wed Jul 25, 2007 3:12 am

It depends how much you want to limit your usage of your afflictions stringlist I guess - if the ONLY time you're accessing it is when you cure something, you might see a benefit from #beginupdating the variable as soon as you log in and then using #endupdate before your #switch command goes through to find something to cure (and then it'll #beginupdate again). But this'll cause problems if you wanted to use the afflictions variable for anything else - I used various different real-time displays of the afflictions I had, either using buttons or a window and the #clr command, or just printing a list into the main MUD window every time the list changed. Stuff like that would need to keep #endupdating so often I doubt there'd be much benefit.

Posted: Wed Jul 25, 2007 4:53 pm

I think that I've shown that database variables are always going to be faster than string lists in the 2.0 version. The only time a string list can equal the speed of the database variable is when the string list is sorted. The extra time involved in adding keys to a database variable compared to a string list isn't that much in practice, and remember that adding an item to a sorted string list will also take a bit longer (since it needs to get inserted into the correct spot in the list instead of just added to the end).

Think about a string list as a special case of a database variable. In fact, in v2.0, database variables are now displayed in human readable format as a string list:

Wizard Joined: 14 Aug 2004 Posts: 1269

Sounds really cool Zugg! I had noticed that searching string lists was pretty slow in zMUD, and so hadn't made such heavy use of it, but now it opens up more possibilities! #NOSAVE, #BEGINUPDATE/#ENDUPDATE are going to really useful for the intense processing stuff too. Keep it up! Very Happy

Wizard Joined: 14 Aug 2004 Posts: 1269

Posted: Thu Jul 26, 2007 10:40 pm

OK, I think I've got the new cache working. It only recomputes the string for a stringlist or database record when it's needed. This gives a very large speed boost to string lists and database variables. Here are some numbers:

makelist (before cache): 16500 ms
makelist (cache added): 96 ms
makelist (#NOSAVE): 94 ms
makelist (#BEGINUPDATE): 94 ms

searchlist: 250 ms
searchlist (#SORT): 47 ms

makedb (before cache): 28700 ms
makedb (cache added): 188 ms
makedb (#NOSAVE): 187 ms
makedb (#BEGINUPDATE): 187 ms

searchdb: 94 ms

So now that the cache has been added, it doesn't really matter if you use #NOSAVE or #BEGINUPDATE/#ENDUPDATE. You will always get the speed improvement anyway. In theory, the #NOSAVE and #BEGINUPDATE *should* be slightly faster because they are not causing any database update. But you can see that the database update isn't really a big issue, at least in the above test.

It's possible that this new cache might have bugs that could cause CMUD to think that your stringlist or database variable is empty. I've implemented this cache at a very low level, so the change *should* be transparent to the rest of CMUD. But because it's a low-level change, there might be some weird side effect. So far it's looking pretty good though. Just something to keep an eye out for in the beta testing period.

Posted: Fri Jul 27, 2007 5:06 pm

OK, now that the internal cache is working better, I'm not sure there is any need for #BEGINUPDATE/#ENDUPDATE anymore. It no longer makes any difference with the string lists and database records. So I think I'm going to remove these commands from the 2.0 version that I'm working on.

I'll keep the #NOSAVE option. Turns out that #NOSAVE doesn't have much effect on "makelist" and "makedb" because of the internal cache...the new value of the string list or database record isn't saved until the background save thread needs it. But I found another script that shows the difference with #NOSAVE on normal variables. Here is the test script:

Posted: Fri Jul 27, 2007 6:17 pm

With the internal cache working better I agree it's a good thing to remove the BEGINUPDATE/ENDUPDATE.

I'm not so sure about the #NOSAVE though. I guess someone else would be able to make use of it.

Posted: Mon Jul 30, 2007 3:11 am

I was just thinking about the new #sort command. Does this use the new cache as well, only sorting when you access the variable rather than when you update it?

Posted: Mon Jul 30, 2007 5:32 pm

Not really. When you use the #SORT command, the "sorted" property of the internal string list is set. When a string list is marked as sorted, adding an item to it causes it to get inserted into the proper location to maintain the sort. So adding items to a sorted string list is slower. When a string list is marked as sorted, it also uses a binary search method for locating items in the list (which is why %ismember is faster).

If you are going to add a lot of items to a string list, it's best to turn off sorting, then add all of the items, then turn sorting back on:

Newbie Joined: 25 Jun 2007 Posts: 6

What's the ETA on 2.0 anyway? *hides*

Posted: Tue Jul 31, 2007 11:45 pm

Answered this back on the first page :)