English Financial BERT Sentence Embeddings( Finance, 768D)

Description

Pretrained Financial BERT Sentence Embeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP. setfit-finetuned-financial-text-classification is a English Financial model which is maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.

Download Copy S3 URI

How to use

sentence_embeddings = BertSentenceEmbeddings.pretrained("sbert_setfit_finetuned_financial_text_classification", "en")\
  .setInputCols(["sentence"])\
  .setOutputCol("sbert_embeddings")

val sentence_embeddings = BertSentenceEmbeddings.pretrained("sbert_setfit_finetuned_financial_text_classification", "en")
  .setInputCols("sentence")
  .setOutputCol("bert_sentence"))

import nlu
nlu.load("en.embed_sentence.bert.finetuned").predict("""Put your text here.""")

Model Information

Model Name:	sbert_setfit_finetuned_financial_text_classification
Compatibility:	Spark NLP 4.2.4+
License:	Open Source
Edition:	Official
Input Labels:	[sentence]
Output Labels:	[bert_sentence]
Language:	en
Size:	409.0 MB
Case sensitive:	true

References

https://huggingface.co/nickmuchi/setfit-finetuned-financial-text-classification

PREVIOUSFinancial Chinese BERT Sentence Embeddings (Finance, BQCorpus)

NEXTEnglish RobertaForSequenceClassification Cased model (from Kaveh8)