Assume I have the following matrix (defined here in Julia language):
mat = [1 1 0 0 0 ; 1 1 0 0 0 ; 0 0 0 0 1 ; 0 0 0 1 1]
Considering as a "component" a group of neighbour elements that have value '1', how to identify that this matrix has 2 components and which vertices compose each one?
For the matrix mat above I would like to find the following result:
Component 1 is composed by the following elements of the matrix (row,column):
(1,1)
(1,2)
(2,1)
(2,2)
Component 2 is composed by the following elements:
(3,5)
(4,4)
(4,5)
I can use Graph algorithms like this to identify components in square matrices. However such algorithms can not be used for non-square matrices like the one I present here.
Any idea will be much appreciated.
I am open if your suggestion involves the use of a Python library + PyCall. Although I would prefer to use a pure Julia solution.
Regards
Just got an answer from julia-users mailing list that solves this problem using Images.jl, a library to work with images in Julia.
They developed a function called "label_components" to identify connected components in matrices.
Then I use a customized function called "findMat" to get the indices of such matrix of components for each component.
The answer, in Julia language:
The answer is pretty simple (though i can't provide python code):
In pseudocode (using BFS):
Using
Image.jl
'slabel_components
is indeed the easiest way to solve the core problem. However, your loop over1:maximum(labels)
may not be efficient: it'sO(N*n)
, whereN
is the number of elements inlabels
andn
the maximum, because you visit each element oflabels
n
times.You'd be much better off just visiting each element of
labels
just twice: once to determine the maximum, and once to assign each non-zero element to its proper group:Output on your test matrix:
Calling library functions like
find
can occasionally be useful, but it's also a habit from slower languages that's worth leaving behind. In julia, you can write your own loops and they will be fast; better yet, often the resulting algorithm is much easier to understand.collect(zip(ind2sub(size(mat),find( x -> x == value, mat))...))
does not exactly roll off the tongue.