Description
Pretrained Legal Roberta Embeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP. legal-spanish-roberta-base is a Spanish model originally trained by joelito.
How to use
sentence_embeddings = RoBertaEmbeddings.pretrained("roberta_base_spanish_legal", "es")\
.setInputCols(["sentence"])\
.setOutputCol("embeddings")
val sentence_embeddings = RoBertaEmbeddings.pretrained("roberta_base_spanish_legal", "es")
.setInputCols("sentence")
.setOutputCol("embeddings"))
Model Information
| Model Name: | roberta_base_spanish_legal |
| Compatibility: | Spark NLP 4.2.4+ |
| License: | Open Source |
| Edition: | Official |
| Input Labels: | [sentence, token] |
| Output Labels: | [embeddings] |
| Language: | es |
| Size: | 416.2 MB |
| Case sensitive: | true |
References
https://huggingface.co/joelito/legal-spanish-roberta-base