Creating Workflows in Clarifai Community

New

WEBINAR | An AI Operating Model for Content Organization and Personalization

Join the Discord

Listen to our recent tutorial on creating custom workflows in Clarifai Community

This post is a transcription of our recent tutorial video for people who just don't enjoy sitting through videos :)

Workflow: OCR English-to-Spanish

One of the greatest features of Clarifai Community is the ability to use models as building blocks. I'm going to demonstrate here how I can create AI workflows by connecting 2 models together to create a multimodal system, that is, one that combines different types of media like images with text.

wfblog-1-jpg

First, I create the application. This is a container for all our related models and workflows for this particular project. I give it a unique name, a short description, the language, and a default workflow.

wfblog-2-jpg

Boom, application created.

wfblog-3-jpg

This is what the app looks like when it's empty!

wfblog-4-jpg

To create a new workflow, go to workflows, then create workflow.

wfblog-5-jpg

Here we see the no code, drag and drop interface to connect models together! Let's search for an optical character recognizer, which is how computers can extract text from an image, whether it be a scan of a printed page or a photo of street signs.

wfblog-6-jpg

Now we search for a second model, a text-to-text model, which transforms one kind of text into another. Now we draw in the connections that show the flow of information from one model to another.

wfblog-7-jpg

Next we'll specify which model we'll use for optical character recognition, the PaddleOCR model.

wfblog-8-jpg

Now we choose the text-to-text model by searching for the word Spanish, and use an English to Spanish translation model. And we're done!

wfblog-9-jpg

Let's save this workflow and grab a few examples, as I've saved URLs of some demo images that we can try the workflow out with.

wfblog-10-jpg

The models take a moment to run the first time, but after that they're in memory and run quicker. As you can see, it's correctly read the text on the sign in the image, and translated it into Spanish! iTambién podemos vender el tuyo!

wfblog-11-jpg

Taking a look at other images, it's identified all the text and translated it, including a graphic with text, text printed on hanging flags, and a bilingual poster where it left the Spanish text unchanged.

wfblog-12-jpg

We can even save our workflow, and use it in another application.

wfblog-13-jpg

Workflow: ASR Sentiment Analysis

Another awesome multimodal workflow we can create is converting speech to text, then analyzing if contains positive or negative sentiment. Once again, I'll use the same app we just created to add the workflow. I'm renaming it to ASR sentiment, which is short for automatic speech recognition sentiment analysis. I grab an audio-to-text model, connect the wiring, and then drag in and connect a text classifier.

wfblog-15-jpg

wfblog-16-jpg

I then select the first model in the chain and search for an English audio-to-text model, then pick the Wav2vec model and choose the most recent version.

wfblog-17-jpg

Then I click the text classifier model, search for sentiment, and open up the full choices from community to choose the sentiment-analysis Distilbert model, and again select the most recent version.

wfblog-18-jpg

I save it, and again I'll bring in some examples I'd previously set up to test this workflow.

wfblog-19-jpg

Let's take a listen to the first one. A good morning and a great presentation is definitely a positive statement, and the prediction shows that it is!

wfblog-20-jpg

And this concludes this tutorial! Two multimodal workflows created in just a few minutes, and that's the power of Clarifai Community.

Explore Clarifai Community!

Explore 1000s of pre-trained models from Clarifai and other leading AI builders. Build, share and reuse models and workflows with other community members.