How to approximate float function by integer numbers?
3 views (last 30 days)
Show older comments
Dan Richter
on 20 Dec 2022
Commented: Walter Roberson
on 29 Dec 2022
I need to spare same space in 2 kB Flash MCU to finish with the program to control servos where x = 0.0° to 90.0°
How to approximate the function float y = x / 90 * 1250 + 3750 by an integer function, preferably using uint16_t
The integer divisor should be a power of 2.
0 Comments
Accepted Answer
Les Beckham
on 20 Dec 2022
I'm assuming you want to replace the floating point calculation with an integer one? I'm not sure how that is going to "spare some space" unless you can't do floating point calculations and you don't have room for floating point emulation in software.
There are a lot of possible ways to do that depending on details of your application which you didn't provide.
This example comes pretty close.
x = 0:0.1:90;
y = x./90*1250 + 3750;
x2 = uint16(0:90 * 111); % scale your x values by 111
y2 = x2/8 + 3750; % replace /8 with right-shift 3 places
plot(x, y, x2/111, y2)
xlabel 'x'
ylabel 'y'
legend('Float calculation', 'Integer calculation', 'Location', 'southeast')
grid on
2 Comments
More Answers (1)
Walter Roberson
on 29 Dec 2022
General exact process:
This can be done more compactly using bitand() and bitget() and similar, but sometimes it is easier to think in terms of streams of bits.
format long g
R = (1250/90)
U64 = typecast(double(R), 'uint64');
B64 = dec2bin(U64, 64);
numerator = int64(bin2dec(['1', B64(end-51:end)]))
denominator = 2^(510 + 52 - bin2dec(B64(2:11)))
sign = 2 * (B64(1)=='0') - 1
reconstructed = sign * double(numerator) / double(denominator)
R - reconstructed
Approximating with a 16 bit denominator would take more work. Or perhaps less...
1 Comment
Walter Roberson
on 29 Dec 2022
format long g
R = (1250/90)
D = 15 - ceil(log2(abs(R)));
denominator = uint16(2^D)
sign = 1; if R < 0; sign = -1; end
numerator = sign * int16(floor(abs(R) * denominator))
reconstructed = double(numerator) / double(denominator)
R - reconstructed
When you look at those, at first it looks as if it would be plausible that you could gain another bit of accuracy by using a numerator one bit different from twice as large as the existing one, so 56888 +/- 1. But if you do that then you lose the room for the numerator to be negative.
This code will not work properly for input values less than 1.
See Also
Categories
Find more on Logical in Help Center and File Exchange
Products
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!