How can this be completed with the Google Vision-API please?
- send image to vision-api
- request: 'features': [{': 'LABEL_DETECTION','maxResults': 10,}]
- receive the labels in particular the one I'm interest in is a "clock"
- receive the boundingPoly so that I know the exact location of the clock within the image
- having received the boundingPoly I would want to use it to create a dynamic AR marker to be tracked by the AR library
Currently it doesn't look like Google Vision-API supports a boudingPoly for LABELS hence the question if there is a way to solve it with the Vision-API.
Currently Label Detection does not provide this functionality. We are always looking at ways to enhance the API
After two years, its the same. I am facing similar challenges and I am thinking of opting other solutions. I think custom solutions like
TensorFlow object detection API
orDarkNet YOLO object API
will do this job very easily.