Department Mission: We are seeking an enthusiastic and motivated intern to
join our Language Modeling team and contribute to the creation of training data
and optimization of the model evaluation process. As an intern, you will have
the opportunity to work closely with our experienced professionals and gain
valuable hands-on experience in the field of AI and Large language model.
Your Responsibilities:
* Create and annotate training data for language models. Collect and
preprocess raw text data from various sources. Create high quality and original
prompt/response pairs.
* Assist in the development and maintenance of data pipelines, collaborate
with data engineers and machine learning engineers to design and implement data
processing workflows.
* Optimize model evaluation procedures, develop and implement evaluation
metrics and benchmarks for language models, Analyze model performance and
identify areas for improvement.
* Participate in brainstorming sessions and propose new ideas for model
optimization and evaluation, Support research initiatives by conducting
literature reviews, collecting and analyzing data.
* Keep detailed records of data processing procedures, annotation guidelines,
and evaluation results. Document findings, insights, and recommendations in
collaboration with the team
Required Qualification:
* Fluent English
* Above Bachelor Degree
* Currently enrolled in an undergraduate or graduate program with a focus on
computer science, linguistics, or a related field
* Strong interest in natural language processing, machine learning, and
artificial intelligence
* Proficient in Python and familiar with common NLP libraries and frameworks
(e.g., NLTK, spaCy, TensorFlow, PyTorch)
* Knowledge of machine learning evaluation metrics and statistical analysis
What We Can Offer:
* Remuneration 170 RMB/day for Bachelor,200RMB/day for Master students gross.
* Housing allowance up to RMB 2500 for students that are not Beijing
residents and are not studying in Beijing.
* Internship certificate