raquelsilveira
/

legalbertpt_fp

Inference Endpoints

Model card Files Files and versions Community

raquelsilveira commited on Mar 19, 2024

Commit

6474251

·

verified ·

1 Parent(s): f86d6c4

Update README.md

Files changed (1) hide show

README.md +20 -2

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ license: openrail
 LegalBert-pt is a language model for the legal domain in the Portuguese language. The model was pre-trained to acquire specialization for the domain, and later it could be adjusted for use in specific tasks. Two versions of the model were created: one as a complement to the BERTimbau model, and the other from scratch. The effectiveness of the model based on BERTimbau was evident when analyzing the perplexity measure of the models. Experiments were also carried out in the tasks of identifying legal entities and classifying legal petitions. The results show that the use of specific language models outperforms those obtained using the generic language model in all tasks, suggesting that the specialization of the language model for the legal domain is an important factor for improving the accuracy of learning algorithms.
-Keywords: Language model, Legal Bert pt-br, Legal domain
 ## Available models
 |Model|Initial model|#Layers|#Params|
@@ -44,4 +44,22 @@ from transformers import AutoModel  # or BertModel, for BERT without pretraining
 model = AutoModelForPreTraining.from_pretrained('raquelsilveira/legalbertpt_fp')
 tokenizer = AutoTokenizer.from_pretrained('raquelsilveira/legalbertpt_fp')
-```

 LegalBert-pt is a language model for the legal domain in the Portuguese language. The model was pre-trained to acquire specialization for the domain, and later it could be adjusted for use in specific tasks. Two versions of the model were created: one as a complement to the BERTimbau model, and the other from scratch. The effectiveness of the model based on BERTimbau was evident when analyzing the perplexity measure of the models. Experiments were also carried out in the tasks of identifying legal entities and classifying legal petitions. The results show that the use of specific language models outperforms those obtained using the generic language model in all tasks, suggesting that the specialization of the language model for the legal domain is an important factor for improving the accuracy of learning algorithms.
+Keywords: Language model, Legal Bert pt br, Legal domain, Portuguese Language Model
 ## Available models
 |Model|Initial model|#Layers|#Params|
 model = AutoModelForPreTraining.from_pretrained('raquelsilveira/legalbertpt_fp')
 tokenizer = AutoTokenizer.from_pretrained('raquelsilveira/legalbertpt_fp')
+```
+## Cite as
+@inproceedings{10.1007/978-3-031-45392-2_18,
+author = {Silveira, Raquel and Ponte, Caio and Almeida, Vitor and Pinheiro, Vl\'{a}dia and Furtado, Vasco},
+title = {LegalBert-pt: A Pretrained Language Model for the Brazilian Portuguese Legal Domain},
+year = {2023},
+isbn = {978-3-031-45391-5},
+publisher = {Springer-Verlag},
+address = {Berlin, Heidelberg},
+url = {https://doi.org/10.1007/978-3-031-45392-2_18},
+doi = {10.1007/978-3-031-45392-2_18},
+booktitle = {Intelligent Systems: 12th Brazilian Conference, BRACIS 2023, Belo Horizonte, Brazil, September 25–29, 2023, Proceedings, Part III},
+pages = {268–282},
+numpages = {15},
+keywords = {BERTimbau, BERT, Legal Texts, Language Models},
+location = {Belo Horizonte, Brazil}
+}