times ten in fewest CPU cycles?

asmboy · Joined: 20 Nov 2007 Posts: 2128 Location: albany ny

I need to do the fastest possible multiplication by ten for unsigned 16 bit INTS with a value range of 0-4095 ( output of 12bit ADC) and make process decisions as rapidly as i can after getting a new reading.

over the range of input values ( 0-4095) , code for *10 can profile out to
a few hundred cpu cycles. so i created a dedicated function for this that does what i need deterministically, irregardless of input ,
in a constant 25 instructions.

every cycle counts in this application and
what i'm not sure of is if it can be done more efficiently yet?

Anybody?

temtronic · Posted: Tue Nov 12, 2013 4:18 pm

While slowly reading your request, I was thinking shift and add twice..hmmm..scolled down to your function and it's the same !!
I don't think there is a faster way to do it

So great minds do think alike !!

hth
jay

dmendesf · Joined: 31 Dec 2005 Posts: 32

The fastest way I can think is to trade space for speed:

dmendesf · Joined: 31 Dec 2005 Posts: 32

You can also trade less space for less speed using this:

k = a*256+b // b from 0 to 255, a from 0 to 15

10 * k = 10*a*256 + 10*b

asmboy · Joined: 20 Nov 2007 Posts: 2128 Location: albany ny

nice ideas above.

1-having less than fastest speed is a nogo,

2- the pic16 chip in question (selected for its unique hardware functions)
only has 8k worth of 'word space' Very Happy

and a table of 4096 'words' of int16 data
leaves appx zero code space for the rest of the program. Sad

the table lookup would surely be the bomb if i was using an 18f46k80,
which Sad

, i am not.

dmendesf · Joined: 31 Dec 2005 Posts: 32

I think that the second option I posted can be faster than your approach and it needs "only" 512 words of memory. You should benchmark it and also a small variation using make8 to acess 16 bit variables... I don´t know if CCS compiler is smart enough to do all pointer arithmetic in compile time for the union... maybe the make8 is the key for it.

alan · Joined: 12 Nov 2012 Posts: 357 Location: South Africa

Hi asmboy

if you put all in the return with another shift, it shaves some instructions off on my 4.141.

Ttelmah · Joined: 11 Mar 2010 Posts: 19535

This is one of those cases where using a function is less efficient...

The compiler knows that it can perform *8, and *2, using rotations. The optimiser handles this, so:

ckielstra · Joined: 18 Mar 2004 Posts: 3680 Location: The Netherlands

Instead of using a macro I tried to #inline the function because I like the parameter type checking from functions. Too bad it seems like #inline isn't working in my v4.147.

We now only had a look at just one line of your algorithm. Multiplying a sampled value by 10 'smells' like there are more optimizations possible given a look at the total algorithm.

Ttelmah · Joined: 11 Mar 2010 Posts: 19535

4.147?....

I thought 4.141 was the last.

Anyway on a version where it does work, it still adds the extra instructions to do an extra move to an extra variable.

If one is prepared to use two lines, then the most efficient I can think of with the compiler, is:

jeremiah · Joined: 20 Jul 2010 Posts: 1353

I checked on PCD 5.013 and inlining appears to work. It's still not as efficient as the macro. I found that making the input a reference parameter shaves off the unnecessary moves, but there is still a single "extra" instruction that I assume comes from their code generation algorithm (or maybe it does have a purpose...I didn't go too deeply into it)

bkamen · Posted: Wed Nov 13, 2013 9:07 am

asmboy · Joined: 20 Nov 2007 Posts: 2128 Location: albany ny

many thanks to all who contributed.

i have to use the 16f1509 for its NCO feature , found in none of the 18f family
and also have to live with a reduced 10 bit ADC and needing to use external EEPROM as well. Sad

how microchip can release a chip with so many unique and desirable functions , yet omit EEPROM is a mystery and a frustration to this fellow.

the macro define solution is the icing on the cake!

special thanks to Mr. T for your singularly focused analysis.

temtronic · Posted: Wed Nov 13, 2013 12:13 pm

It could be that a customer wanted a special PIC with those specific features and low,low cost ,thus the '16F1509' was born.
It wouldn't be the first time a 'new product' was added to a manufacturer's line. Heck I did that decades ago with energy control products.
These days it's super easy to 'make-my-PIC', all you need is tell uChip what you need and some cash.

hth
jay