Inference Workflow in General

Important

This page is incomplete. Make sure to follow the examples such as About Single-Prompt Example, About Dialogue Example and Embeddings Example.

The inference workflow is same and simple for all types of models including the ones that are not implemented yet.

Here is the flow of operations user should employ to achieve non-blocking inference: