Arjun Parthasarathy

MSc Data Science for Decision Making student at Maastricht University researching AI in healthcare, and digital privacy.


At the moment, work is split across three problems with almost no methodological overlap: characterising eigenvalue regions for doubly stochastic matrices (an open problem in pure mathematics since the 1960s), analysing the legal architecture of EU Chat Control legislation, and building clinical risk models for CML patients that can explain their outputs to treating physicians. The MSc at Maastricht is in Data Science for Decision Making — a programme that treats quantitative methods and real-world decision contexts as the same subject — which accounts for some of the range.

Client-Side Content Scanning & Regulatory Framework Analysis

Client-Side Scanning (CSS), as proposed under the EU Chat Control regulation, would require message content to be scanned on a user's device before encryption — placing the detection mechanism outside the encrypted channel. Technical analysis focused on two failure modes: high false positive rates in content matching, and the feasibility of deploying hidden dual-purpose models capable of facial recognition. The critical finding was structural: the legislation delegates scanning authority to private entities, which makes mass surveillance architecturally possible independent of stated legislative intent. This constitutes a violation of Articles 7 and 8 of the EU Charter of Fundamental Rights, which guarantee privacy and data protection.

Nonlinear Inequality System for Doubly Stochastic Matrices

Doubly stochastic matrices have known eigenvalue characterisations in low dimensions, but for n ≥ 6 the question of which eigenvalue regions are achievable remains open — a problem unresolved for roughly sixty years. The work investigated whether a nonlinear inequality system could characterise those regions. Because no analytical solution exists and there is no ground truth to validate against, the approach relied on heuristic methods: simulated annealing, neural networks, interval arithmetic, and grid search. The result was a ruling-out. The solution space grows in complexity with dimension in a way that makes generalisations from smaller cases infeasible.

Cardiovascular Disease Prediction with Explainable AI

Cardiovascular disease is a secondary risk for patients with Chronic Myeloid Leukaemia (CML), and predicting that risk requires working with data where individual predictions carry direct clinical consequences. A random forest model was trained for risk stratification, then integrated with SHAP for global feature attribution, LIME for local approximation, and DiCE for counterfactual generation. Explainability matters here for a specific reason: a clinician cannot act on a score they cannot interrogate. A knowledge graph built from clinical guidelines added a layer of structured reasoning to support treatment suggestions alongside the model's output.

Publication

Early Diagnosis of Glaucoma and Diabetic Retinopathy Using Fundus Images Based on Ensemble Approach

An ensemble classifier combining XGBoost and LightGBM was trained to detect glaucoma and diabetic retinopathy from fundus images. The model reached 99% accuracy on the dataset, outperforming CNN and SVM baselines on the same evaluation. Published in Springer Proceedings.

DOI: 10.1007/978-3-031-82389-3_19

CareData Infomatics

Intern — Chennai,

COVID-19 patient records were analysed across variables including age, vaccination status, and comorbidities to identify which factors predicted higher-risk outcomes. K-means clustering produced risk-level segments; logistic regression was then applied to assign probabilistic scores to new records. The clearest result from the data was the predictive weight of cardiovascular disease: patients with pre-existing cardiovascular conditions were consistently classified at elevated risk across both methods.

Maastricht University

MSc Data Science for Decision Making, –present

SRM Institute of Science and Technology

BTech Computer Science and Engineering,

Specialisation: Big Data Analytics

Languages Python, Java, C++, MATLAB, SQL
ML / Data Scikit-learn, TensorFlow, Pandas, NLTK
Tools Tableau, Excel
UM Chess — Vice President

Maastricht, –present

Organised a blitz tournament for student members. Arranged a simultaneous exhibition with International Master Dr. Christian Seel — a format in which a single titled player moves between boards, playing multiple opponents at once. Conducts biweekly chess lessons for members with Candidate Master Michal Bodicky.

Phone (NL)
+31 6 87726719
Phone (IN)
+91 70109 70831