Blog::Trapple

技術系のちょっとしたメモとか

What is Cloud Vision API? の文字起こし


What is Cloud Vision API?

英語の聴き取り/書き取りの練習にちょうど良さそうだったので。

Cloud Vision provides powerful Image Analytics capabilities as easy to use APIs.
It enables application developers to build the next generation of application that can see, and understand the content within the images.
The service is built on powerful computer vision models that power several different Google services.
The service enables developers to detect a broad set of entities within an image from everyday objects, faces, and product logos.
The service is so easy to use.
As one of example of the use cases.
You can have any Raspberry Pi robot like GoPiGo calling the Cloud Vision API directly.
So the bot can send images taken by its camera to the cloud and can get analysis result to me real time.
It detects faces in an image along with the associated emotions.
The Cloud Vision API is also able to detect entities within the image.
Now, let's see how facial detection works.
Cloud Vision detects faces on the picture and returns the positions of eyes, nose, and mouth.
So you can program the bot to follow the face.
It also detects emotions such as joy, anger, surprise, and sorrow.
So the bot can move forward smiling faces or avoid anger or surprise face.
One of the very interesting features of Cloud Vision API is the entity detection that means it detects any objects you like.
That see however ex.
"It's glasses."
"It's banana."
"It's automobile."
"It's money."
You see?
Cloud Vision lets developers take advantage of Google's latest machine learning technologies quite easily.
Please go to cloud.google.com/vision to learn more.

自分の英語レベルだと、初見で等速聴き取りは結構きつかったので、以前このブログで紹介したmimiCopyを使って60%くらいの速度にして聴き取りをしました。

trapple.hateblo.jp

それでもどうしても聴き取れない場合はHiNativeというサービスでネイティブに質問するのがとても有効でした。
https://hinative.com/en-US/questions/260544
https://hinative.com/en-US/questions/261258

ちなみにCould Vision APIはまだ試してません^^;