VFPU math lib

evilo · Post by **evilo** » Wed Jun 20, 2007 8:51 am

see http://forums.ps2dev.org/viewtopic.php?t=7539

Do we have a similar lib on the PS2 ?

Too bad that we don't have closer SDK between the PS2 and the PSP (seeing how much the PSP scene is active comparing to the PS2 one).

Post by **ooPo** » Wed Jun 20, 2007 10:22 am

Its a lot easier to set up a PSP for development, and its new and shiny so more people are interested in it.

You can use vu0 in macro mode and get similar results. Take a look at the math3d code in ps2sdk for an example.

evilo · Post by **evilo** » Wed Jun 20, 2007 4:57 pm

the toolchain script is almost the same, and apart the fact that you need a way to run unsigned code on the PS2, it's not difficult!

anyway, as soon as I will get my PS2 back (that is still travelling across europe), I'll have a closer look at it, I have a few things that would benefits of such optimizations.

evilo.

Lukasz · Post by **Lukasz** » Wed Jun 20, 2007 11:49 pm

You can also look for inspiration in some of the VU0 macro code in libito:

http://svn.ps2dev.org/filedetails.php?r ... rev=0&sc=0
http://svn.ps2dev.org/filedetails.php?r ... rev=0&sc=0

Post by **emoon** » Fri Jun 22, 2007 5:40 am

If you really want to do math calculation in an efficient manner on the PS2 you need to use vu0 micromode.

What that means is that you do something like this (a basic mul example)

Code: Select all

Vu0 code &#40;or something like this&#41;

               mul       vf01, vf02, vf03  nop
               nop&#91;e&#93;                      nop

And then the ee code:

Code: Select all


..upload MyMathFunction to Vu0 goes here..

Then&#58;

// set some input registers

	__asm__ volatile &#40;"lqc2	vf02,0x00&#40;%0&#41;\n" &#58; &#58; "r" &#40;&myValue1&#41; &#58;  "memory"&#41;;
	__asm__ volatile &#40;"lqc2	vf03,0x00&#40;%0&#41;\n" &#58; &#58; "r" &#40;&myValue2&#41; &#58;  "memory"&#41;;

// calls the vu0 program &#40;start from address 0 in vu0&#41;

	__asm__ volatile &#40;"vcallms 0\n"&#41;;

// do something on the ee while vu0 is calculating...

// ...

// Get the result from vu0 &#40;3 vnops to make sure the calulation has finished if it takes 4 cycles&#41;

			__asm__ volatile &#40;"vnop\n"&#41;;
			__asm__ volatile &#40;"vnop\n"&#41;;
			__asm__ volatile &#40;"vnop\n"&#41;;
			__asm__ volatile &#40;"sqc2	vf01,0x00&#40;%0&#41;\n" &#58; &#58; "r" &#40;&myResult&#41; &#58;  "memory"&#41;;

Sure to use vu0 in macromode (similar to vfpu on psp) you get better performance than not using vu0 at all.

But infact its even faster to use vu0 micromode even if you block on it directly afterwards because you dont need to fetch any extra instructions to the I-cache and vu0 just runs more efficient in micromode.