It will be a long question, please, take a deep breath before reading.
I want to understand what would be the fastest algorithm to convert index of one dimensional array to a vector index of a multidimensional array.
Let's proceed with an example to understand why do I need it:
I have a 2 dimensional array: Array[i1][i2]
i1 runs from i1_b=0 to i1_e=2
i2 runs from i2_b=0 to i2_e=1
So this array is outputted into the file line by line:
Array[0][0]
Array[0][1]
Array[0][2]
Array[1][0]
Array[1][1]
Array[1][2]
Now I read the file line by line and index k is the number of the line being read last.
I read the first line which is Array[0][0] and k=0
I read the second line which is Array[0][1] and k=1
...
One can notice that k will run from k_b=0 to k_e=5 and
k=0 will correspond to i1=0, i2=0
k=1 will correspond to i1=0, i2=1
...
Problem: So my problem is how to convert k into i1 and i2 the fastest way possible? (I don't need it while reading the file, but later in my program)
In this example, one of the solutions would be
i1=k/(i1_e - i1_b + 1);
i2=k%(i1_e - i1_b + 1);
Question 1: Is it the fastest possible solution in term of cycles and computer time?
OK. Question 2: How can we generalize this algorithm to multidimensional arrays?
Array[i1][i2][i3][i4]
i1=k/(i1_e - i1_b + 1);
i2=k%(i1_e - i1_b + 1);
i3=i2/(i1_e - i1_b + 1);
i4=i2%(i1_e - i1_b + 1);
Question 3: Is it the fastest way to do it?
Question 4: related question would be what is the latency for modular division, integer division, adding integers and multiplying integers? If these numbers depend on the architecture, please, also let me know.
Thanks in advance!
P.S. It may be easier for someone to think about this problem as the fastest algorithm to convert seconds into days-hours-minutes-seconds.
please find below how I would implement this in C++1x, I hope this can be useful. Cheers
If you have an array
arr[dim_1][dim_2]...[dim_n]
, you have the equationso
i_1 = k / (dim_2*..*dim_n)
andr_2 = k % (dim_2*...*dim_n)
, thenetc,
until
i_n = r_n
is found.If the dimensions are known at compile time, the divisions can be replaced by multiplications, shifts and additions/subtractions. On many architectures, that is faster than a division instruction. On others, not.
But it's only worthwhile thinking about if you're doing a lot of indexing in that array and not much else.
These numbers depend on the architecture and processor.