Talk:ITP method

	This article was reviewed by member(s) of WikiProject Articles for creation. The project works to allow users to contribute quality articles and media files to the encyclopedia and track their progress as they are developed. To participate, please visit the project page for more information.Articles for creationWikipedia:WikiProject Articles for creationTemplate:WikiProject Articles for creationAfC articles
C	This article has been rated as C-class on Wikipedia's content assessment scale.
	This article was accepted from this draft on 4 January 2021 by reviewer Eumat114 (talk · contribs).

Mathematics C‑class Low‑priority

	Mathematics portal This article is within the scope of WikiProject Mathematics, a collaborative effort to improve the coverage of mathematics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.MathematicsWikipedia:WikiProject MathematicsTemplate:WikiProject Mathematicsmathematics articles
C	This article has been rated as C-class on Wikipedia's content assessment scale.
Low	This article has been rated as Low-priority on the project's priority scale.

How to tune kappa parameters?

One of the weak spots in this article is how to tune the $\kappa$ parameters, particularly $\kappa _{1}$ . The original paper discusses the question a bit, but doesn't itself give any clear guidance. After a bit of experimentation (mostly on inverse arc length problems), I'm pretty confident that $\kappa _{2}=2$ is a reasonable default choice, suitable in most cases unless experiment or analysis strongly suggests a different value. But $\kappa _{1}$ is a bit of a different question. The current example in this article uses a value of $0.1$ , which is the same numerical value as the examples in the paper, but I think that fact is a bit misleading. The examples in the paper have $b-a=2$ , while the example here has $b-a=1$ , and there is scaling involved. If $\kappa _{2}=2$ , then I think the heuristic that captures the intent of the paper is $\kappa _{1}=0.2/(b-a)$ , and this is what I have documented in the kurbo implementation. Also note, if the example were to adopt this recommendation, then $n_{0}=0$ would work with fast convergence; I think you can say that you need more "slack" as provided by $n_{0}$ when $\kappa _{1}$ is lowballed.

On that last point, I feel like I might be skating a little close to the edge of WP:NOR here. I personally consider an open source project such as kurbo a reasonably reliable source, but can see how others may feel differently. Ideally there would be some survey paper or textbook chapter that we could cite, but, partly due to the technique being so new, there's precious little out there. I'm going to shamelessly self-promote in the interest of providing useful resources to readers, but open to hearing better ways to handle this. Raph Levien (talk) 16:19, 5 April 2021 (UTC)[reply]

Raph Levien, I have added a small note to link to your library (I *feel* that's compliant to NOR). Cheers 🦀🍺. --Artoria 2e5 🌉 12:56, 4 July 2021 (UTC)[reply]

Oh, while we are at it, I feel the description has a gap in the description of

n_{0}

. The paper (and this Wikipedia article) only mentions it's ≥ 0, while your documentation seems to suggest a smaller useful range of [0,1] further constrained by the optimization to use usize? --Artoria 2e5 🌉 13:13, 4 July 2021 (UTC)[reply]

Observations from tuning of n0, epsilon, and k1

As the WP:NOR policy does not apply to Talk pages, I think it's OK to share a some yet-unpublished recommendations here: I have conducted a quasi Monte Carlo parametric study of epsilon and $n_{0}$ , using two alternative monotonic curves of the form $y=x^{((1/c)-1)}$ and $y=1-(1-x)^{(c/(1-c))}$ , where $c$ is a ranomized constant in the range $c=(0,1)$ , and $x$ is in the range $x=(0,1)$ . Simulations were conducted in Visual Basic (machine floating-point epsilon = 1E-15). My findings are as follows:

Tuning of epsilon

I tried values 1E-16 $\leq \epsilon \leq$ 0.1, in steps of factor 10.
As $\epsilon$ is reduced, the number of iterations increases and precision improves. However this effect levels out for small values of $\epsilon \leq$ 1E-10.
For large values of $\epsilon$ : 1E-9 to 0.1, there is little benefit of this algorithm compared to basic Bisection, both in terms of number of iterations, and precision of $x_{ITP}$ .
For small values of $\epsilon$ : 1E-16 to 1E-10, there is negligible difference in performance, both in terms of number of iterations and precision of $x_{ITP}$ . It levels out, with median 8 iterations.
Therefore I recommend to set $\epsilon$ equal to the machine epsilon (1E-15 in my case), as there are no drawbacks compared to using lower precision.

Tuning of n0

I tried values $0\leq n_{0}\leq 20$ in steps of +1, and up to 50 in steps of +10.
$n_{0}=0$ is identical to Bisection.
$n_{0}=1$ halves the number of iterations compared to Bisection, and improved precision of $x_{ITP}$ (median 100000 times more accurate, and 3rd quartile 30 times more accurate), assuming small $\epsilon \leq$ 1E-10.
Increasing $n_{0}$ further (i.e. $n_{0}=2$ and higher) leads to further improvements in precision (median and Q3 statistics) with the added benefit of reducing the Mean number of iterations (median and Q3 remain static). The effect levels out exponentially, with little improvement over say $n_{0}$ = 6 to 8.
The maximum possible number of iterations increases with $n_{0}$ : Max iterations = $n_{bisection}+n_{0}$ . However, the likelihood of this hapening sinks for higher $n_{0}$ , so the Mean number of iterations actually sinks for higher $n_{0}$ , but the effect levels out exponentially over say $n_{0}$ = 6 to 8, as mentioned above.
Thus I recommend $n_{0}=6$ as an all-round value. I see little benefit in developing adaptive algorithms that automatically tune $n_{0}$ , as I found little difference in the range $3\leq n_{0}\leq 20$ .

Tuning of k1

I tried scaling factors in the range $0.01<=f<=1.00$ in increments of 0.01, in calculating $k_{1}=f/(x_{b}-x_{a})$ .
The arithmentic mean number iterations reduces exponentially as $f$ increases from zero towards approx 0.33, above which the number of iterations flattens out in the region $0.33<f<=0.58$ , so the optimal value of $f$ may be anywhere in this region. Above 0.58, the number of iterations increaces again but only slightly.
The median number of iterations is minimum in the range $0.08<=f<=0.33$ , and the third-quartile is minimum in the range $0.30<=f<=0.58$ .
Thus I recommend $k_{1}=0.33/(x_{b}-x_{a})$ as an all-round value together with $n_{0}=14$ and epsilon = 1E-15, which worked well for the large range of monotonic function shapes that I tested. This finding is obviously influenced by the distribution of monotonic function shapes that I tried.

Comment written by Peter.schild (talk) 15:28, 2 January 2023 (UTC)[reply]