Bulgarian Legal Roberta Embeddings

Description

Pretrained Legal Roberta Embeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP. legal-bulgarian-roberta-base is a Bulgarian model originally trained by joelito.

Download Copy S3 URI

How to use

sentence_embeddings = RoBertaEmbeddings.pretrained("roberta_base_bulgarian_legal", "bul")\
  .setInputCols(["sentence"])\
  .setOutputCol("embeddings")

val sentence_embeddings = RoBertaEmbeddings.pretrained("roberta_base_bulgarian_legal", "bul")
  .setInputCols("sentence")
  .setOutputCol("embeddings"))

Model Information

Model Name:	roberta_base_bulgarian_legal
Compatibility:	Spark NLP 4.2.4+
License:	Open Source
Edition:	Official
Input Labels:	[sentence, token]
Output Labels:	[embeddings]
Language:	bul
Size:	416.1 MB
Case sensitive:	true

References

https://huggingface.co/joelito/legal-bulgarian-roberta-base

PREVIOUSZero-Shot Named Entity Recognition (Generic sample)

NEXTCroatian Legal Roberta Embeddings