K 10 svn:author V 3 bde K 8 svn:date V 27 2008-02-19T12:54:14.000000Z K 7 svn:log V 387 Rearrange the polynomial evaluation for better parallelism. This saves an average of about 8 cycles or 5% on A64 (amd64 and i386 -- more in cycles but about the same percentage on i386, and more with old versions of gcc) with good CFLAGS and some parallelism in the caller. As usual, it takes a couple more multiplications so it will be slower on old machines. Convert to __FBSDID(). END