Topic: FotF: Dwarf Fortress 40d19 (Read 163289 times)

zxcvmnb · « **Reply #300 on:** March 15, 2010, 07:00:49 pm »

Well, it would be quite nice to be able to modify the raws

Also, I presume that the different versions would create their own $HOME (or $HOME/.df) folders, especially as "making DF properly installable is something that will be done after not too long," presumably while there are still testing builds around and utilities which don't work with them.

bombcar · « **Reply #301 on:** March 15, 2010, 09:32:16 pm »

In theory, it wouldn't be to hard to make DF take an MD5sum or similar of the RAWs and even warn you if something is out of date when trying to load a save.

But this is all fancy-stuff; it's not necessary; but a bit of it, done right, could make life simpler for newer players.

peterix · « **Reply #302 on:** March 16, 2010, 02:19:18 am »

Please, just don't break the 'portable' part:

Spoiler (click to show/hide)

Baughn · « **Reply #303 on:** March 16, 2010, 04:17:51 am »

Relax, I wasn't going to.

vyznev · « **Reply #304 on:** March 16, 2010, 09:59:23 am »

Quote from: koitsu on March 14, 2010, 02:06:27 pm

Quote from: Linknoid on March 14, 2010, 11:44:39 am
Notice how "x * dimy * 4 + y * 4" is used over and over. My initial code was full of stuff like that as well. But probably half my optimization gains came from simply calculating values like that into a local variable ahead of time:
Whatever compiler you're using is amazingly horrible if what you said is indeed true. Compilers are supposed to do *exactly* what you describe on their own. You'd have to post the assembly code associated with the C code piece as proof. I'm pretty certain that if you're using Intel, Microsoft (VC), or gcc 3.2.x+ that the code in question would be optimised to do what you describe (very likely shoving the contents of the "base" calculation into a register if possible/available, otherwise into a temporary variable). This is, of course, assuming you're building with -O or -O2. And likely with gcc, there's probably an -f argument which affects the optimisation behaviour in cases like this (can't be bothered to look it up).

The relevant -f option here may well be -fstrict-aliasing. The problem is that if, say, any of x, y and dimy are globals and screen is a pointer, then the compiler cannot be entirely sure that

Code: [Select]

screen[x * dimy * 4 + y * 4] = 0;will not change the values of x, y or dimy (since screen might point to the same area of memory they occupy), which means it will have to refetch them from memory the next time they're used. In a tight loop, this can really kill performance.

(Rest of longish post spoilered for brevity, feel free to skip.)

Spoiler (click to show/hide)

A partial solution, as I noted, is to turn on strict aliasing, which lets the compiler make more assumptions about how arrays can overlap with each other (and with other variables); specifically, strict aliasing basically says that (quoted from Wikipedia) "it is illegal (with some exceptions) for pointers of different types to reference the same memory location." Unfortunately, strict aliasing has two drawbacks: first, it can easily break otherwise valid code not written with such rules in mind, and second, it won't help at all if the variables involved in fact do have the same type (for example, if x is an int and screen is an int *).

Another solution is the C99 restrict keyword, which basically extends strict aliasing by letting you promise to the compiler that the area of memory referenced through a particular pointer won't overlap anything else. In Linknoid's example, as well as mine, if screen had been declared as, say, int * restrict screen, then the problem would go away entirely. Of course, it might be replaced by a bigger problem if one did have another pointer, used in the same function, pointing to the same area of memory: then the promise given by the restrict keyword would be violated, and broken code could be generated.

A third approach is to follow the "load-use-store" paradigm (which is basically what Linknoid did): always load any individual values you might be using more than once into local variables (which the compiler knows won't be aliased, and which it will usually be able to store in registers), preferably before any speed-critical loops. Then do whatever you need to do using those variables and, if necessary, store them back into their original locations at the end. This is essentially a manual way of doing the kind of optimizations the compiler could do if you used strict aliasing and restrict diligently, but the manual approach also works with strict aliasing turned off; and since it's your code and you know what it's supposed to be doing, you can sometimes make such optimizations even in cases where it would be extremely difficult or impossible to coax the compiler into doing the same thing.

Also, IMHO using the "load-use-store" style even when it might not be strictly necessary is often good practice; it rarely if ever hurts performance, since modern compilers are also very, very good at optimizing register usage, and with well chosen variable names it can make your code cleaner and more readable. Also, I've seen many programmers (including myself, once or twice) get bitten by simple aliasing and/or variable reuse bugs (in the simplest cases, just trivial things like

Code: [Select]

struct node *pop_first (struct node **list) {
    if (*list) *list = (*list)->next;
    return *list;  /* oops, should've returned the _original_ value */
}

) that could've been easily avoided just by always using copious temporary variables and letting the compiler decide when those variables are no longer needed. It makes your code faster, cleaner and safer; what's not to like?

Baughn · « **Reply #305 on:** March 16, 2010, 11:42:25 am »

I.. see. That's kind of disturbing.

Okay, point taken. I'll make sure to do that in performance-critical code, then.

Baughn · « **Reply #306 on:** March 16, 2010, 06:15:57 pm »

Yep. Careful micro-management cut the critically important graphicst::display function's cost by 4x.

I am most disturbed. Compilers are supposed to be smarter. Like GHC.

bluea · « **Reply #307 on:** March 16, 2010, 07:53:02 pm »

Compilers are Dwarves with the beards abstracted away.

They can pull completely amazing maneuvers, yet manage to die of thirst in the river.

Rafal99 · « **Reply #308 on:** March 17, 2010, 11:01:24 am »

Yeah I noticed some time ago that there parts of the code which can be possibly easily optimized.
Unfortunately I couldn't get the whole stuff to compile, and also my tests with small parts of code called separately in the long loop didn't give me any significiant results, so I gave up.

Now the code in github in file graphics.cpp is as bad as then,
except that instead of screen[x ][y][3] it is screen[x2*dimy*4 + y2*4 + 3]...
The compiler may optimize it or may not, I wouldn't count of it.
Adding some local variables here and there is a quick task and requires very little effort. Imo it is better to do it just to be sure it is efficient.

Also things like this:

Spoiler (click to show/hide)

It could be better to make a simple struct and copy one pointer instead of three.
Since all this code is inside a nested loop I would try to get as much optimization as possible.

Edit: Just noticed this code is inside the graphicst::display you posted about, it seems I was right

random51 · « **Reply #309 on:** March 17, 2010, 11:02:07 am »

Quote from: Baughn on March 16, 2010, 06:15:57 pm

Yep. Careful micro-management cut the critically important graphicst::display function's cost by 4x.

I am most disturbed. Compilers are supposed to be smarter. Like GHC.

While you're disturbed, can we throw a party assuming 40d20 will be faster?

Andir · « **Reply #310 on:** March 17, 2010, 11:12:02 am »

Quote from: random51 on March 17, 2010, 11:02:07 am

Quote from: Baughn on March 16, 2010, 06:15:57 pm
Yep. Careful micro-management cut the critically important graphicst::display function's cost by 4x.

I am most disturbed. Compilers are supposed to be smarter. Like GHC.

While you're disturbed, can we throw a party assuming 40d20 will be faster?

That would depend if you have idle folks here and a meeting hall assigned...

PencilinHand · « **Reply #311 on:** March 17, 2010, 12:23:04 pm »

Quote from: Andir on March 17, 2010, 11:12:02 am

Quote from: random51 on March 17, 2010, 11:02:07 am
Quote from: Baughn on March 16, 2010, 06:15:57 pm
Yep. Careful micro-management cut the critically important graphicst::display function's cost by 4x.

I am most disturbed. Compilers are supposed to be smarter. Like GHC.

While you're disturbed, can we throw a party assuming 40d20 will be faster?
That would depend if you have idle folks here and a meeting hall assigned...

Well, I am idle, and I have a table over here.... Now where is that Strawberry wine.

kuaera · « **Reply #312 on:** March 17, 2010, 12:52:32 pm »

Just an FYI: the linux version links to libjpeg.so.8, and only libjpeg.so.62 is available on my system; making a symbolic link from libjpeg.so.8 to libjpeg.so.62 appears to work without problems, but linking to libjpeg.so would probably allow those of lesser troubleshooting skills on linux to use the game.

Baughn · « **Reply #313 on:** March 17, 2010, 01:52:15 pm »

The code under active development is in the "matrix" branch of the git repository; you need to look at that if you want to see how it actually works.

And yes, 40d20 should be slightly faster.

vyznev · « **Reply #314 on:** March 17, 2010, 02:31:46 pm »

Quote from: PencilinHand on March 17, 2010, 12:23:04 pm

Quote from: Andir on March 17, 2010, 11:12:02 am
Quote from: random51 on March 17, 2010, 11:02:07 am
While you're disturbed, can we throw a party assuming 40d20 will be faster?
That would depend if you have idle folks here and a meeting hall assigned...
Well, I am idle, and I have a table over here.... Now where is that Strawberry wine.

Fun off-topic fact: If you let your dwarves do nothing but hang around in a fancy meeting hall and hold parties all the time, the socialization will keep them so happy that they'll be permanently ecstatic even if you don't give them any booze at all! Or beds or chairs or tables, for that matter.

(In fact, my recent experiences suggest that if a bunch of idle dwarves run out of both booze and water, they may still continue happily slacking off together until they drop dead of dehydration, still ecstatic. I haven't confirmed this in a controlled experiment yet, though. Also, the last ones to survive might eventually become unhappy after enough of their friends have croaked.)

After a few years of non-stop partying they'll also end up superdwarvenly strong, tough and agile. The exact mechanism by which social interaction causes them to gain these stats is unknown, but it seems reasonable to presume that it must involve more than just standing around and talking.

...we now return to your scheduled, um, programming.

News:

Poll

Having tested both 2D and STANDARD, how is 40d19 compared to 40d?

Author Topic: FotF: Dwarf Fortress 40d19 (Read 163289 times)

zxcvmnb

Re: FotF: Dwarf Fortress 40d19

bombcar

Re: FotF: Dwarf Fortress 40d19

peterix

Re: FotF: Dwarf Fortress 40d19

Baughn

Re: FotF: Dwarf Fortress 40d19

vyznev

Re: FotF: Dwarf Fortress 40d19

Baughn

Re: FotF: Dwarf Fortress 40d19

Baughn

Re: FotF: Dwarf Fortress 40d19

bluea

Re: FotF: Dwarf Fortress 40d19

Rafal99

Re: FotF: Dwarf Fortress 40d19

random51

Re: FotF: Dwarf Fortress 40d19

Andir

Re: FotF: Dwarf Fortress 40d19

PencilinHand

Re: FotF: Dwarf Fortress 40d19

kuaera

Re: FotF: Dwarf Fortress 40d19

Baughn

Re: FotF: Dwarf Fortress 40d19

vyznev

Re: FotF: Dwarf Fortress 40d19