Hey, I'm Paul Guan

Graduate at Epita School of Computer Engineering in Paris @Paul Guan / pguan.pro@gmail.com

ABOUT ME

Junior engineer with a passion for data and AI (NLP, Machine Learning, Deep Learning), with a solid foundation in software and web development. Curious, proactive and versatile, I enjoy taking on challenges, learning continuously and actively contributing to innovative projects, both as part of a team and independently. My versatility and eagerness to learn are my greatest strengths, enabling me to adapt to any situation.

PROJECTS

Throughout my journey as an aspiring data scientist, I have worked on a wide array of projects that span various domains and technologies.

Each project has been a unique opportunity to apply my skills, solve complex problems, and innovate.

NLP/LLM - Copilosh / Copilosh WEB

December 2024

PYTHON
REACT
DOCKER
AZUR

Copilo.sh is a wrapper function to add to your .bashrc or .zshrc file. Accompanied by a FastAPI local server, running a LM on CPU. It will catch all the errors you make in your terminal (non-zero exit code) and call the LM to generate a response to help you solve the error.

  • Performance: Good but depends on the choice of the model
  • Portability : Running with CPU and could be deployed in a Web Server
  • Deployment : Github CI/CD + Microsoft Azur VM + Canary Deployment with Docker

Video of my project

Data Engineering - TermiCator

June 2024

SCALA
AWS

This project, developed in Scala, aims to alert a large number of users efficiently and effectively. Key features and technologies used in this project include:

  • Project Scalability : Designed to handle a high volume of alerts and user notifications.
  • IoT Device Simulation : Simulated IoT devices triggering alerts based on specific criteria, ensuring relevant notifications are sent to the concerned users.
  • Technologies : Utilized AWS Kinesis, Firehose, S3, DynamoDB, and EMR for data streaming, storage, and processing.
  • Infrastructure as Code : Managed and provisioned resources using Terraform.

Some images of the project

Microsoft Azur - Weather

June 2024

AZUR
PYTHON

This project focuses on predicting weather conditions using various Microsoft Azure services. Key components and technologies used in this project include:

  • Custom Vision : Leveraged Azure Custom Vision for image recognition and analysis related to weather patterns.
  • Machine Learning : Implemented machine learning models to predict weather conditions based on historical data and patterns.
  • Azure Web App Service : Deployed the application using Azure Web App Service for scalability and ease of access.

Some images of the project

BigData - Displaying Stock Prices

April 2024 - May 2024

PYTHON

This project, developed in Python, aims to display stock prices for various companies. Key features and technologies used in this project include:

  • Data Handling : Reading, cleaning, and storing large volumes of stock market data.
  • Libraries Used : Utilized numpy and pandas for data manipulation and matplotlib for data visualization.
  • Data Visualization : Displayed company stock prices using Dash for interactive and dynamic web applications.

Some images of the project

CNN - Image Classification

May 2024

PYTHON

This project, developed in Python using TensorFlow, aims to classify images of ships into 10 different categories. Key aspects of the project include:

  • Goal : Building a convolutional neural network (CNN) to classify images of 10 types of ships.
  • Technology : Developed the CNN using TensorFlow and Keras for efficient and accurate image classification.
  • Model Training : Trained the models and evaluated their performance using relevant metrics.

Some images of the project

Multi-Agent - Simulating behavior in the metro

March 2024 - May 2024

NETLOGO

This project aims to simulate people's behavior and study their reactions in the metro

  • Technology : Developed in NetLOGO.

NLP - Restaurant Review Classification

April 2024 - May 2024

PYTHON

This project, developed in Python, aims to classify restaurant reviews. Key aspects of the project include:

  • Dataset : Used the Semeval-ABSA dataset from Hugging Face.
  • Data Processing : Tokenization using NLTK and regex, byte pair encoding, text normalization (stop words removal, lemmatization, conversion to lowercase).
  • Data Statistics : Conducted comprehensive statistical analysis of the dataset.
  • Model Training : Trained multiple predictive models including RNN and Naive Bayes.

Some images of the project

Hackathon - Ministry of Armed Forces

April 2024

PYTHON

Python project developped in 3 days that allows you to identify if a text is generated by an AI or human. Key aspects of the project include:

  • Data : Training data with 2 labels (AI/Human)
  • Algorithms used : Naives Bayesian/CNN/Logistic Regression

LinkedIn Posts

Sudoku Solver

March 2024 - April 2024

C#

This project involves solving Sudoku puzzles using C#. Key aspects of the project include:

  • Technology : Developed in C#.
  • Algorithm : Implemented the Dancing Links method to efficiently solve Sudoku puzzles.

IDE Shrek

June 2023 - July 2024

JAVA
REACT
JAVASCRIPT
DOCKER

This project involved developing an Integrated Development Environment (IDE) themed around Shrek. Key features and technologies used in this project include:

  • Python Execution : Ability to run Python code.
  • Frontend : Developed using JavaScript and React.
  • Backend : Implemented in Java.
  • Basic IDE Features : Create, open, delete, and move files.
  • Advanced Features : Integrated Git and Maven functionalities.
  • Deployment : Delivered as a Docker container for easy deployment and scalability.

Some images of the project

Tiger Compiler

March 2023 - May 2023

C++

This group project, developed over the course of three months, aimed to create a basic compiler for the Tiger programming language using C++. Key aspects of the project include:

  • Technology : Developed using C++.
  • Group Collaboration : Successfully completed through effective teamwork and collaboration.
  • Duration : Spanned three months of intensive development and testing.
  • More informations about the project : https://assignments.lrde.epita.fr/

42SH

January 2023 - February 2023

C

This group project, developed over the course of four weeks, involved implementing a basic Unix shell command interpreter using the C programming language. Key aspects of the project include:

  • Duration : Completed in four weeks.
  • Technology : Developed using the C programming language.
  • Functionality : Implementation of a basic Unix shell command interpreter.

Genetic Pipeline with Institut Curie

January 2022 - May 2022

PYTHON

This project, conducted in collaboration with the Institut Curie, focuses on recognizing cancer-causing variants using advanced computational techniques. Key aspects of the project include:

  • Goal : Identification of cancer-causing genetic variants.
  • Pipeline Development : Created a data processing pipeline using Snakemake.
  • Neural Networks : Implemented neural networks using Keras for variant recognition.
  • Bioinformatics Tools : Utilized bioinformatics software such as Varsim, Samtools, and IGV.
  • Data Handling : Reading and writing data produced by bioinformatics software.
  • Weekly Reports : Prepared and presented weekly progress reports.
  • Deliverables : Authored comprehensive project deliverables.

Robotics Project with LEGO MINDSTORM

January 2021 - May 2021

JAVA

This project involved designing and programming a LEGO MINDSTORM robot to collect balls, developed in Java. Key aspects of the project include:

  • Programming Language : Implemented in Java.
  • Team Collaboration : Executed as a group project.
  • Weekly Reports : Prepared and presented weekly progress reports.
  • Documentation : Authored various documents including project specifications, testing plans, and development plans.
  • Project Defense : Delivered a final presentation at the end of the project.

Video of my project

Professional Experiences

Sofware Data Engineer at SII / February 2025 - August 2025

  • Projects QT
  • C++
  • Development software using AI

Design and development of advanced software solutions for defense and radar systems

My responsibilities included :

  • Designed and developed mission-critical defense applications incorporating modern AI techniques into software development processes
  • Developed C++/QT applications with intelligent algorithm implementation to enhance radar system performance and accuracy
  • Optimized AI tooling by engineering advanced prompts for LLMs (particularly GitHub Copilot) to accelerate development cycles
  • Conducted applied research on AI effectiveness in software development, including comparative analysis of productivity gains and code quality improvements
  • Performed ongoing technology watch on industrial AI innovations, benchmarking best practices for integration into critical systems

Key Strengths:

  • Expertise in critical systems-oriented C++ (QT, multithreading, low-level programming)
  • Proficiency in AI-powered development workflows (Copilot, LLMs)
  • Data-driven scientific approach to process optimization

Full Stack Developer at Brainsonic / September 2023 - February 2024

  • Projects Symfony or WordPress
  • HTML/CSS/JS/PHP
  • Technologies Web 3

At Brainsonic, my role involved developing various web projects, including websites, web applications, and web games for a diverse range of clients.

My responsibilities included :

  • Development of innovative web functionalities : Designing and implementing new features for websites.
  • Collaboration and problem-solving : Actively participating in solving technical challenges in close collaboration with project teams.

Some images of the projects at Brainsonic

Curriculum Vitae

Contact

For further discussions or inquiries, please feel free to get in touch with me.