underflow and log values in java

I have a question in regard to dealing with small probabilities values in machine learning models.

The standard way to avoid underflow problems which results from multiplying small floating-point numbers is to use log(x) instead of x

suppose that x=0.50 the log of which is log(x)=-0.301029996

to recover x later on the value of exp(log(x)) != x that is

0.740055574 != 0.50

So, how is using the logarithm is useful to deal with underflow??

标签： java math numbers double

2条回答

时光不老，我们不散

2楼-- · 2019-09-07 06:37

(Not at all sure I remember correctly, so please correct me if I'm wrong.)

This is not really about overflow or underflow, but about floating point precision.

The idea is that if you have many very small numbers, multiplying them will produce an extremely small number. Say, you have ten probabilities of 1%, or 0.01 each. Multiply them, and the result is 1e-20. In those regions, floating point precision is not very good, which can introduce errors. In the worst case, the number could be 'rounded' to zero, which would break the entire calculation.

The trick wth logarithms is that after conversion to logarithms,

the values will generally be on a much smaller scale (in the sense of having a smaller exponent),
instead of multiplying the values, you just have to add them, so very small (or very big) numbers do not get even smaller (or bigger) as fast, and
once you've taken the log of all the candudate probabilities, all the other calculations are just additions, not multiplication, which should also be a bit faster.

Example (using Python, because I'm too lazy to fire up Eclipse, but the same works for Java):

>>> x,y,z = 0.01, 0.02, 0.03
>>> x*y*z
6.0000000000000002e-06
>>> log(x)+log(y)+log(z)
-12.023751088736219
>>> exp(log(x)+log(y)+log(z))
6.0000000000000002e-06

Also, as pointed out in the other answer, the problem with your particular calculation is that you seem to use a logarithm base-10 (log10 in Java), for which not exp(x) is the inverse function, but 10^x. Note, however, that in most languages / math libraries, log is in fact the natural logarithm.

0人赞添加讨论(0) 举报

Explosion°爆炸

3楼-- · 2019-09-07 06:55

This has nothing to do with the overflow. In the first log, you compute the log in base 10, instead of the natural logarithm. You can do this:

raise 10^log(x) to get back x, or use the natural logarithm.

0人赞添加讨论(0) 举报

underflow and log values in java

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间