input embeddings, output similarity. Can this be done through an inference endpoint?

#38

by click37 - opened Nov 22, 2023

Nov 22, 2023

Hi there

I am computing the embeddings of sentences using the model. I store them so that I don't run this computation in real time. I then get a new sentence in real-time, compute the embedding and want to run a sentence similarity between this new embedding and all others I did prior.

Can this be done here?

sharad

Dec 2, 2023

You can use any open source vector db(chroma, weaviate) to retrieve best matches of semantic similar sentences. I hope that's what you're looking for.

wmedrano

May 27, 2024

I'm also looking to do something similar with the inference API. However, the API seems to only support getting sentence similarity and not the actual vectors.

Supported

Inputting a source_sentence.
Inputting several other_sentences.
Getting the similarity of other_sentences with source_sentence.

Not Supported (Or maybe not well documented)

Inputting sentences or sentence.
Getting the 384 dimensional vectors.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment