Super fast addition?

curt2go · Joined: 21 Nov 2003 Posts: 200

I know the title may be a bit illusive. But what I am doing is I need to process 512 words of info in a loop.

Processor 24EP256GP206 running at 140MHz

Right now I am not running a loop because writing out the individual lines is way faster.

variable[0] += variable2[0];

That is what I am doing 512 times. variable and variable2 are global because they are used elsewhere. To do this 512 times takes 23uS. If it is done in a for loop it takes 175uS. But the issue is that I have to process variable2 before it gets added. I need to add a factor to it. Now these variables are signed int16's. I need to factor them like this:

float factor = 1.14;
float result;
result = variable2[0] * factor;
variable[0] += result;

The factor will change all the time and this process gets done all the time. Every 23mS to be exact. The above statement took like 7mS to do 512 times or something, i cant remember exactly, to process. Which is way, way too long. Now would changing it to a signed INT32 then making the factor an INT8 or something be way better? Or even using pointers?
Ideally i would like to be under 200uS.

I thought I would ask here because you guys are phenomenal at knowing which way is the best. Thank you in advance.

curt2go · Joined: 21 Nov 2003 Posts: 200

I found the post where PCM shows how to use the simulator in MPLAB. I'm going to run a bunch of simulations in the meantime. I also read in another thread where there were some options about using int32 but only using the top bytes. Not sure I understand that. Any light would help. I need this to be as fast as possible.

Sorry that this is a dumb question anyways.

curt2go · Joined: 21 Nov 2003 Posts: 200

Ok this is where I am at with the SIM. Normally I would just use floats cause i dont care about speed and thus I am having issues with the math and such here.

Let me know what you think. The math does not seem to be coming out right. At least on the sim.

temtronic · Posted: Mon Jul 16, 2018 2:29 pm

comment.
you should post a few of the results that you get vs what you expect, as well as the interim values....

Jay

curt2go · Joined: 21 Nov 2003 Posts: 200

I put the actual SIM numbers beside each line. I really was hoping for under 200uS but it looks like that is not even possible. The big one is converting from the int16 to int32 for the math. I also will have to do some error checking in each one as i cant go over 32767 or under 32767. I only have the positive error check in there right now.

I also need to figure out how to do the math with negative numbers because I cant do a bit shift like I am with the positive numbers. Unless I am missing something.

I have to run this routine 3 times for each 23mS between SD card reads. I also have alot of other stuff going on so that is why the shorter the better. I am using the DMA to write to the codec so that is not getting in the way at all now.

So any suggestions would be awesome. I can try them on the SIM to see pretty quick.

curt2go · Joined: 21 Nov 2003 Posts: 200

Here is the latest with handling negative numbers as well. This one is 944uS.

Let me know if you can see any efficiencies.

temtronic · Posted: Mon Jul 16, 2018 4:59 pm

just an idea...
instead of this...

*** if(variable2[i] < 0)
*** neg = 1;
w >>=8; // 0.1uS
*** if(neg)
*** w = 0xFFFFFFFF - w;//turn it back negative again if it was.
w +=variable[i];//0.2uS
if(w > 32767)//0.27uS
w=32767;//0.07uS
if(w < -32767)//0.27uS
w = -32767;//0.07uS

if(variable2[i] < 0)
w = 0xFFFFFFFF - w;//turn it back negative again if it was.

w >>=8; // 0.1uS

w +=variable[i];//0.2uS
if(w > 32767)//0.27uS
w=32767;//0.07uS
if(w < -32767)//0.27uS
w = -32767;//0.07uS
variable[i] = w;//0.07uS

Only the first statement after an IF() gets executed, so my thinking is you can eliminate the settting of the neg variable and the later test.
If I'm correct it should speed up the overall process.
If I'm wrong, well, it's 90*F in the shade and drier than the desert here, sorry, my brain's fried !
Jay

PCM programmer · Joined: 06 Sep 2003 Posts: 21708

curt2go · Joined: 21 Nov 2003 Posts: 200

Yeh. The version i use now is all unrolled. Its takes up more ROM but i have that space to do so. The biggest ones are the converting to INT32 i do see. But not sure how I can get around that as I need the larger numbers instead of using floats. I will take a look more into the LST file and see where I can do some stuff.

And Temtronic that is a good idea. I will try that one. Since the data is probably half negative only It will cut down the time on the negative portion for sure.

Ttelmah · Joined: 11 Mar 2010 Posts: 19546

Be aware:

w >>=8; //

Only gives /256, for a +ve number. Not -ve.

Look at:
<https://en.wikipedia.org/wiki/Arithmetic_shift>
Look at the section on 'Non-equivalence of arithmetic right shift and division'.

Don't do scaling like this.

If you want to use an integer factor, use int32 arithmetic. Multiply the factor by 65536, rather than 256. Then take the upper two bytes of the result as the int16 value. This can be done efficiently using a union.

curt2go · Joined: 21 Nov 2003 Posts: 200

That's is a very cool and efficient solution!

It saves some time which is awesome.

But what might be the best way to check for min and max values doing the math this way? I need to add variable[x] += variable2[x]; But the min and max is to be 32767 and -32767.

One weird thing in the simulator the math is always coming out double.
For instance if i use -1000 and do the math with 0.14*65536 as the scale the math should come out with -140 as the answer. But it is always double in this case its -280. I have just assumed its something in the SIM? Any thoughts?

This is the new math in the SIM it is cutting out 200uS so far.

Ttelmah · Joined: 11 Mar 2010 Posts: 19546

I'd be worried about this:

value.wrapper = value.parts[1];

Remember value.parts, is 'part' of wrapper. This is putting part of a number back into the same RAM area. No idea quite what the effect would actually be!... Suspect the compiler may be having a hiccup on this which is resulting in the doubling.

curt2go · Joined: 21 Nov 2003 Posts: 200

It was doubling before I was using this math. It does the same thing here.

variable2 = 6000;

6000 *0.14 = 840 but it comes out with 1679

curt2go · Joined: 21 Nov 2003 Posts: 200

If i use 32768 in the scale then I come out with the right number.

Ttelmah · Joined: 11 Mar 2010 Posts: 19546

Just stuck a basic program together and run it up in a different PIC, and it works fine: