Add ability to extract encoder embedding. #1604

colinator · 2023-12-07T20:36:35Z

Adds the ability to get the encoder output. No other top-level methods expose ggml_tensors though - I'm not sure that's cool. It seemed the quickest way. What I'm doing, eventually, is this:

ggml_tensor * tensor = whisper_get_encoder_embedding(ctx);
std::vector<float> tensor_data(ggml_nelements(tensor));
ggml_backend_tensor_get(tensor, tensor_data.data(), 0, ggml_nbytes(tensor))

... so maybe it should expose a method that returns or populates a vector, instead of returning a ggml_tensor? What do you think?

Added methods to get encoder embedding

Added implementations of methods to get encoder embedding.

whisper.cpp

whisper.h

bobqianic

Style

whisper.cpp

ggerganov

so maybe it should expose a method that returns or populates a vector, instead of returning a ggml_tensor?

Yes, populating a buffer with the data would be more portable compared to returning ggml_tensor:

WHISPER_API void whisper_get_encoder_embedding(float * buffer);

bobqianic · 2023-12-29T12:30:15Z

so maybe it should expose a method that returns or populates a vector, instead of returning a ggml_tensor?

Yes, populating a buffer with the data would be more portable compared to returning ggml_tensor:
WHISPER_API void whisper_get_encoder_embedding(float * buffer);

This leads to the question: how can user determine the size of the buffer? In this scenario, we end up with only a float pointer, which points to the buffer we've just filled in whisper_get_encoder_embedding

ggerganov · 2023-12-29T13:33:53Z

The buffer size can be determined using the model parameters:

whisper.cpp/whisper.h

Lines 347 to 358 in 2623640

    
           WHISPER_API int whisper_model_n_vocab      (struct whisper_context * ctx); 
        
           WHISPER_API int whisper_model_n_audio_ctx  (struct whisper_context * ctx); 
        
           WHISPER_API int whisper_model_n_audio_state(struct whisper_context * ctx); 
        
           WHISPER_API int whisper_model_n_audio_head (struct whisper_context * ctx); 
        
           WHISPER_API int whisper_model_n_audio_layer(struct whisper_context * ctx); 
        
           WHISPER_API int whisper_model_n_text_ctx   (struct whisper_context * ctx); 
        
           WHISPER_API int whisper_model_n_text_state (struct whisper_context * ctx); 
        
           WHISPER_API int whisper_model_n_text_head  (struct whisper_context * ctx); 
        
           WHISPER_API int whisper_model_n_text_layer (struct whisper_context * ctx); 
        
           WHISPER_API int whisper_model_n_mels       (struct whisper_context * ctx); 
        
           WHISPER_API int whisper_model_ftype        (struct whisper_context * ctx); 
        
           WHISPER_API int whisper_model_type         (struct whisper_context * ctx);

colinator added 2 commits December 7, 2023 15:17

Update whisper.h

e9ca797

Added methods to get encoder embedding

Update whisper.cpp

8b25761

Added implementations of methods to get encoder embedding.

bobqianic reviewed Dec 21, 2023

View reviewed changes

whisper.cpp Outdated Show resolved Hide resolved

bobqianic reviewed Dec 21, 2023

View reviewed changes

whisper.h Show resolved Hide resolved

bobqianic approved these changes Dec 25, 2023

View reviewed changes

bobqianic requested changes Dec 25, 2023

View reviewed changes

whisper.cpp Outdated Show resolved Hide resolved

whisper.cpp Outdated Show resolved Hide resolved

bobqianic added 2 commits December 25, 2023 23:01

Update whisper.cpp

f53acc2

Update whisper.cpp

aa2e63e

ggerganov reviewed Dec 29, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ability to extract encoder embedding. #1604

Add ability to extract encoder embedding. #1604

colinator commented Dec 7, 2023

bobqianic left a comment

ggerganov left a comment

bobqianic commented Dec 29, 2023

ggerganov commented Dec 29, 2023

Add ability to extract encoder embedding. #1604

Are you sure you want to change the base?

Add ability to extract encoder embedding. #1604

Conversation

colinator commented Dec 7, 2023

bobqianic left a comment

Choose a reason for hiding this comment

ggerganov left a comment

Choose a reason for hiding this comment

bobqianic commented Dec 29, 2023

ggerganov commented Dec 29, 2023