Partnership & Collaboration

Collaboration with LatamGPT CENIA Project

Strategic partnership with CENIA's LatamGPT initiative to advance Latin American language models and foster regional AI research collaboration.

LatamGPTLLMsAI
English | Español

Key Highlights

Overview

We are excited to announce our collaboration with the LatamGPT project, led by CENIA (Centro Nacional de Inteligencia Artificial) in Chile. This partnership aims to strengthen Latin American AI research and contribute to the development of language models specifically designed for Spanish and Portuguese speakers across the region.

LatamGPT represents a critical effort to address the linguistic and cultural nuances of Latin America that are often underrepresented in global AI systems. Through this collaboration, SIMG brings expertise in efficient model training (QLoRA, LoRA) and domain-specific fine-tuning to support this important regional initiative.

Partnership Objectives

Our collaboration with CENIA and the LatamGPT project focuses on several key areas:

1. Knowledge Exchange

2. Model Development

3. Community Building

Initial Meeting Highlights

Our first official meeting with CENIA researchers was held in October 2024, where we discussed:

Discussion Points

Key Takeaways

Why This Partnership Matters

Regional Impact

Latin America has unique linguistic characteristics that differ from European Spanish:

Academic Collaboration

This partnership strengthens the Latin American AI research ecosystem:

Our Contributions

Technical Expertise

SIMG brings several key capabilities to the partnership:

1. Efficient Training Methods

2. Domain Specialization

3. Open-Source Philosophy

Planned Activities

Short-term (2024-2025)

Medium-term (2025-2026)

Long-term Vision

How to Get Involved

This is an open collaboration, and we welcome participation from:

Researchers

Students

Institutions

Technical Alignment

Infrastructure Compatibility

Both SIMG and LatamGPT share similar constraints and priorities:

# Example: Efficient fine-tuning setup compatible with both teams
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import LoraConfig, get_peft_model

# Load base LatamGPT model (when available)
model = AutoModelForCausalLM.from_pretrained(
    "cenia/latamgpt-base",
    load_in_4bit=True,  # Efficient memory usage
    device_map="auto"
)

# Apply QLoRA for domain adaptation
lora_config = LoraConfig(
    r=16,
    lora_alpha=32,
    target_modules=["q_proj", "v_proj"],
    lora_dropout=0.05,
    bias="none",
    task_type="CAUSAL_LM"
)

model = get_peft_model(model, lora_config)
print(f"Trainable parameters: {model.num_parameters(only_trainable=True):,}")

Shared Goals

Impact Metrics

We will track the success of this collaboration through:

Research Outputs

Community Engagement

Model Performance

Contact & Updates

Want to learn more or participate in this collaboration?


Stay tuned for updates on this exciting partnership as we work together to advance Latin American AI research and create language models that truly represent our region!

Resources

Team & Collaborators

Researchers

  • SIMG Research Group

Collaborators

Supported by