Department Mission: We are seeking an enthusiastic and motivated intern to 
join our Language Modeling team and contribute to the creation of training data 
and optimization of the model evaluation process. As an intern, you will have 
the opportunity to work closely with our experienced professionals and gain 
valuable hands-on experience in the field of AI and Large language model.
Your Responsibilities:
 * Create and annotate training data for language models. Collect and 
preprocess raw text data from various sources. Create high quality and original 
prompt/response pairs.
 * Assist in the development and maintenance of data pipelines, collaborate 
with data engineers and machine learning engineers to design and implement data 
processing workflows.
 * Optimize model evaluation procedures, develop and implement evaluation 
metrics and benchmarks for language models, Analyze model performance and 
identify areas for improvement.
 * Participate in brainstorming sessions and propose new ideas for model 
optimization and evaluation, Support research initiatives by conducting 
literature reviews, collecting and analyzing data.
 * Keep detailed records of data processing procedures, annotation guidelines, 
and evaluation results. Document findings, insights, and recommendations in 
collaboration with the team
Required Qualification:
 * Fluent English
 * Above Bachelor Degree
 * Currently enrolled in an undergraduate or graduate program with a focus on 
computer science, linguistics, or a related field
 * Strong interest in natural language processing, machine learning, and 
artificial intelligence
 * Proficient in Python and familiar with common NLP libraries and frameworks 
(e.g., NLTK, spaCy, TensorFlow, PyTorch)
 * Knowledge of machine learning evaluation metrics and statistical analysis
What We Can Offer:
 * Remuneration 170 RMB/day for Bachelor,200RMB/day for Master students gross.
 * Housing allowance up to RMB 2500 for students that are not Beijing 
residents and are not studying in Beijing.
 * Internship certificate