Find the error condense logarithms

9/19/2023

When we take logs, it's again about two interquartile ranges below the new median. Meanwhile a low value like 30 (only 4 values in the sample of size 1000 are below it) is a bit less than one interquartile range below the median of $y$. In the case of $y$, it's 5 interquartile ranges above the median.īut when we take logs, it gets pulled back toward the median after taking logs it's only about 2 interquartile ranges above the median. When we looks at the original data, a value at the far right - say around 750 - is sitting far above the median. In the first diagram, $x$, $y$ and $z$ all have means near 178, all have medians close to 150, and their logs all have medians near 5. Taking logs "pulls in" more extreme values on the right (high values) relative to the median, while values at the far left (low values) tend to get stretched back, further away from the median. So we can imagine looking at some kind of "standardized" variables (while remaining positive, all have similar location and spread, say) Note that when we're looking at a picture of the distributional shape, we're not considering the mean or the standard deviation - that just affects the labels on the axis.

We can see that this might help at least sometimes to reduce the amount of right-skewness. If we wanted our distributions to look more symmetric, and perhaps more normal, the transformation clearly improved the second and third case. One the other hand, the most skew variable ( $z$) is still (slightly) right skew, even after taking logs. You can see that the center case ( $y$) has been transformed to something close to symmetry, while the more mildly right skew case ( $x$) is now somewhat left skew. The bottom row contains histograms for their logs. The top row contains histograms for samples from three different, increasingly skewed distributions. The economist is likely to plunge ahead anyway since what we really like about the transformation are points 1,2,and 4-7.įirst let's see what typically happens when we take logs of something that's right skew. Log-normally distributed or where logging the data does not result in the transformed data having equal variance across observations, a statistician will tend not to like the transformation very much. This, I think, is because they judge my point 8 and the second half of my point 3 to be very important. Statisticians generally find economists over-enthusiastic about this particular transformation of the data. Normally distributed data have lots going for them.

If your data are log-normally distributed, then the log transformation makes them normally distributed.
Well, at least with OLS and other related estimators. change the units of) $X$ or $Y$, it will have absolutely no effect on the estimated value of $\beta_2$. This means, on the one hand, that it has no units, and, on the other hand, that if you re-scale (i.e.

The slope coefficient, $\beta_2$, becomes scale-invariant.If $X$ is years, then the coefficient is annual growth rate in $Y$, for example.

In this case, $\beta_2$ is the growth rate in $Y$-measured in whatever time units $X$ is measured in.
If $X$ is time, again you include it without logging it, typically.
In this case, $\beta_2$ is the percent difference in $Y$ between the $X=1$ category and the $X=0$ category.
If $X$ is a dummy variable, you include it without logging it.
It is the percentage increase in $Y$ from a one percent increase in $X$.
The coefficient $\beta_2$ is interpreted as an elasticity.
As TrynnaDoStat mentions, the log-log form "draws in" big values which often makes the data easier to look at and sometimes normalizes the variance across observations.
I've drawn it with $\beta_1=0$ and $\epsilon=0$, but in a real application neither of these would be true, so that the slope and the height of the curves at $X=1$ would be controlled by those rather than set at 1. \ln$, so which can have any positive slope), a hyperbola, a parabola, and a "square-root-like" shape. We especially love it in regression models, like this:

Economists (like me) love the log transformation.

0 Comments

Find the error condense logarithms

Leave a Reply.

Author

Archives

Categories