Skip to content
View KnightKotOR's full-sized avatar
๐Ÿ”ฎ
Take in the weight each moment bears
๐Ÿ”ฎ
Take in the weight each moment bears
  • Saint-Petersburg
  • 17:03 (UTC +03:00)

Block or report KnightKotOR

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
KnightKotOR/README.md

Hello there ๐Ÿ‘‹

I focus on the intersection of applied ML and ML Engineering โ€” from experiments to deployment.

Currently: polishing projects, diving deep into MLOps, and open to Data Scientist / ML Engineer positions.

Languages and Tools

ML / DL: Python PyTorch Scikit-learn XGBoost CatBoost Optuna SHAP

Data: SQL Pandas NumPy SciPy Matplotlib

Engineering: Git Docker GitHub Actions RabbitMQ uv

๐Ÿ“‚ My projects ๐Ÿ“‚:

  1. Music Genre Classifier:

    Group project โ€” ML lead contributor

    • Designed and trained a 2 MB CNN, achieving F1 macro = 0.82 without data leakage; accuracy is comparable to a 380 MB fine-tuned transformers
    • Adapted the model for inference in a container using Docker and RabbitMQ (The container was configured by the other member)
    • Team practices: PR reviews, conventional commits, feature-branch flow, CI checks on all merges
  2. Support ticket prioritization:

    End-to-end ML project โ€” classification on imbalanced data with business constraints

    • Built a priority classifier (Low / Medium / High) on a 50K synthetic dataset with severe class imbalance
    • Eliminated critical errors: zero cases of Highโ†’Low misclassification
    • Used TreeSHAP to verify model relies on real business factors (downtime, affected users)
  3. Ultrasonic welding optimization:

    R&D thesis project for the "Polymer Composite Materials" lab, SPbPU
    Bachelor's thesis โ€” defended with excellence 25 June 2025 ๐Ÿ”’ private โ€” refactoring in progress ๐Ÿ”’

    • Implemented a surrogate assisted bayesian optimization system based on CatBoost + Optuna
    • Achieved 1.4โ€“3ร— reduction in the number of experiments needed to find the global optimum compared to classical Bayesian optimization
    • Validated on benchmark functions (Branin, Rosenbrock, harmonic) with systematic methodology
    • Compared:
      • CV configurations (4-,6-,8-fold and LOO)
      • ML models for a surrogate (CatBoost, XGBoost, RandomForest, MLP with Nadaraya-Watson kernel)
      • Optimization methods (CMA-ES, TPE, GP, QMC)
  4. Topological Data Analysis research:

    Research group project: assisted with TDA pipeline; responsible for the classifier

    • Built a TDA pipeline extracting robust geometric features from 3D point clouds using gudhi and giotto-tda
    • Improved F1-macro from 0.73 to 0.81 (+10%) over the baseline CNN, demonstrating effectiveness of topological features

Pinned Loading

  1. yrmint/ml-app-arch yrmint/ml-app-arch Public

    Project that implements music genre classification.

    Jupyter Notebook 1

  2. topology-mnist3d topology-mnist3d Public

    Classification of 3D MNIST using TDA + xgboost

    Jupyter Notebook 1

  3. SupportTickets-Prioritization SupportTickets-Prioritization Public

    ML model for an automatic support tickets prioritization

    Jupyter Notebook 1