About Me

I am currently an MSc. student at EPFL in Data Science and a Research Student Assistant at NLP lab supervised by prof. Antoine Bosselut.

Previously, I have been working in the DHLAB as a Research Student Assistant supervised by Frédéric Kaplan. In the past, I completed my BSc. in Computer Engineering at Politecnico di Torino and have done a 6 month internship as a Data Analyst at Fater supervised by Salvatore Croce.

Talk to virtual me


Education

  1. Master’s in Data Science. EPFL. Lausanne, Switzerland. Sep. 2023 - Present
  2. Bachelor’s in Computer Engineering. Politecnico di Torino. Turin, Italy. Sep. 2019 - Jul. 2023

Experience

  1. Student Research Assistant. NLP lab. EPFL, Switzerland. Jun. 2024 - Present.
    Multilingual Model Training
  2. Student Research Assistant. DHLAB. EPFL, Switzerland. Feb. 2024 - Sep. 2024.
    Text-to-SQL system and LLM agents development.
  3. Data Analyst. Fater. Italy. Nov. 2022 - May. 2023.
  4. IoT Engineer. LINKS. Italy. Oct. 2021 - Feb. 2022.

Teaching Experience

  1. Student Teaching Assistant. Applied Data Analysis [CS-433]. EPFL. Fall 2024.

Projects

  1. LLM Training with SFT, DPO, and RAG. Instruction-Tuned Galactica-1.3B model on Scientific MCQA task using SFT and DPO, and further tuned in the RAG settings. [report]
    #DeepLearning #MachineLearning #LLM #FineTuning #RAG #NLP
  2. Coin Detection and Classification. A computer vision project where we implemented the segmentation and classification of the coins in the given images. [code, slides]
    #ComputerVision #DeepLearning #MachineLearning #CNN
  3. Reinforcement Learning on Mountain Car Environment. For the Mountain Car environment, we implemented Dyna, DQN algorithms, and the extensions of DQN with heuristic rewards. [code, report]
    #ReinforcementLearning #DeepLearning #MachineLearning
  4. YouTube Analysis. Causal Analysis of Tech channels’ progress on YouTube using the videos published between May 2005 and October 2019. [code, datastory, blog]
    #DataAnalysis #CausalAnalysis #MachineLearning #DataVisualization
  5. LLM Fine-Tuning. Fine-tuned 3 LLMs (Mistral-7B, Llama-2-7B, Phi-1.5) on a dataset from X for the stance detection task. [report, blog]
    #DeepLearning #MachineLearning #LLM #FineTuning #NLP
  6. Cardiovascular Diseases Prediction. Implemented standard ML algorithms using native python libraries and numpy for Classification taks. [code, report]
    #MachineLearning #Classification

Mini Projects (less than a week)

  1. Gender Classification. [code]
    #DeepLearning #Classification
  2. Pneumonia Prediction. [code]
    #DeepLearning #Classification
  3. Customer Satisfaction Prediction. [code]
    #MachineLearning #Classification
  4. Ticket Price Prediction. [code]
    #MachineLearning #Regression

Relevant Courseworks

  1. CS-612. Topics in NLP. [webpage]
  2. CS-552. Modern NLP. [webpage]
  3. CS-456. Artificial Neural Networks and Reinforcement Learning. [github, webpage]
  4. CS-433. Machine Learning. [github, webpage]
  5. CS-401. Applied Data Analysis. [github, webpage]
  6. EE-451. Image Analysis and Pattern Recognition. [webpage]
  7. EE-556. Mathematics of data: from theory to computation. [webpage]
  8. CS-423. Distributed Information Systems. [webpage]

External Activities

  1. Jon and John, YouTube. A YouTube channel about Italian Culture, and studying in Italy. Now I mostly do interviews with the people in AI.
  2. Jon and John, Telegram. An Uzbek community I co-founded back in Italy to bring together Uzbek Students in Italy.
  3. Student Help. An education consultancy service I started with friend. Now run by Muslimjon Nabijonov.
  4. Jakhongir Saydaliev, Telegram. A personal Telegram channel where I share my thoughts, experiences and knowledge (in Uzbek).

News

Nov. 2024 Won the AXA challenge in Lauzhack 2024 Hackathon [project]
Sep. 2024 Participated in DeepFake Hackaton
Jun. 2024 Started Summer Research Internship in NLP Lab on Multilingual Model Training project by Swiss AI Initiative
May. 2024 Presented my LLM Agents project in DHLAB
Apr. 2024 Participated in LLM Hackathon
Apr. 2024 Presented my text-to-SQL project in DHLAB
Mar. 2024 Participated in AMLD 2024
Feb. 2024 Started Student Research Assistantship in DHLAB on LLM QA system project by Venice Time Machine