// data scientist & technologist

Blake Allen

AI · Machine Learning · Conservation Tech

Building intelligent systems at the intersection of healthcare, artificial intelligence, and planetary conservation — from clinical data pipelines to protecting sea turtles on the Ecuadorian coast.

Get in Touch
Python Machine Learning LLMs Full-Stack Healthcare Data Conservation Nanotech

Science, Code & Impact

I'm a data scientist and engineer driven by a single question: how can technology make the world meaningfully better? That curiosity has led me from building AI-powered healthcare platforms to launching a 501(c)(3) on the Ecuadorian coast to protect sea turtles.

My work lives at the frontier — wrangling messy clinical datasets, designing machine learning pipelines, and exploring emerging fields like nanotechnology and autonomous AI agents. I believe the most important problems — human health and planetary health — deserve the best tools we have.

When I'm not deep in a Jupyter notebook or architecting a backend, you'll find me following breakthroughs in molecular biology, conservation genetics, and AI alignment.

5+
Years in ML & Data Science
501c3
Founder, Ecuadorian Wildlife
Curiosity for Science
🐢
Sea Turtles Protected

Technical Toolkit

AI & Machine Learning
PyTorch TensorFlow scikit-learn LLMs NLP Computer Vision RAG Agents
Languages
Python SQL TypeScript JavaScript R Bash
Data Engineering
Pandas Spark dbt Airflow PostgreSQL Snowflake Kafka
Full-Stack Dev
React FastAPI Node.js Docker AWS GCP
Healthcare
FHIR HL7 Clinical NLP EHR Integration HIPAA
Emerging & Frontier
Nanotechnology Genomics AI Alignment Conservation Tech Synthetic Biology

Professional Journey

2024 — Present
Founder & Executive Director
Punta Tortuga Environmental Alliance · 501(c)(3)

Founded and built a nonprofit from the ground up to protect endangered wildlife and coastal ecosystems along the Esmeraldas coast of Ecuador.

  • Established the legal entity as a U.S. 501(c)(3) nonprofit, navigating all regulatory and compliance requirements
  • Led direct funding campaigns that resulted in the purchase of over 70 acres of prime Ecuadorian jungle and sea turtle nesting habitat — securing a permanent conservation legacy
  • Built and managed all technical infrastructure: organizational email systems, donation processing platforms, donor management, and a public-facing web presence
  • Collaborated and coordinated with on-the-ground conservation operations in partnership with puntatortuga.org and environmental stakeholders in Esmeraldas
2023 — 2024
Data Scientist & Co-Founder
Sym.ai · Generative AI

Co-founded an AI company focused on building personalized generative AI systems designed to support individuals in learning, growth, and well-being.

  • Designed and built personalized AI coaching systems using LLMs, leveraging user psychology and personality profiles to tailor experiences
  • Developed holistic support frameworks integrating generative AI with evidence-based models in learning science and behavioral psychology
  • Led the ML architecture for real-time personalization — adapting content, tone, and recommendations based on individual user context
  • Built full-stack platform features across data pipelines, model serving, and user-facing applications
2021 — 2023
Principal Data Scientist
LucidLane · Healthcare Technology

Architecting AI-driven clinical decision support systems that improve patient outcomes in high-acuity care settings.

  • Designed NLP models for clinical note extraction and adverse event prediction using transformer architectures
  • Led cross-functional engineering team delivering HIPAA-compliant data platform on AWS
  • Developed LLM-powered tools to surface actionable care recommendations for clinical teams
  • Built end-to-end ML pipelines capable of processing millions of clinical records, reducing time-to-insight
  • Established MLOps practices including model monitoring, drift detection, and automated retraining
2019 — 2021
Data Scientist & Full-Stack Engineer
Independent / Consulting

Delivered data science and software solutions for clients across healthcare, finance, and environmental sectors.

  • Built predictive models and dashboards for client-facing analytics products
  • Developed React + Python web applications from design through deployment
  • Applied ML to environmental datasets to identify conservation-critical habitat zones
2017 — 2019
Data Analyst & Research Engineer
Research & Early Career

Developed skills in statistical modeling, scientific computing, and research methodology.

  • Built Python data pipelines for large-scale scientific data processing
  • Applied statistical methods to biological and environmental research datasets
  • Contributed to open-source scientific computing projects
2013 — 2017
Co-Founder & Full-Stack Engineer
Metamind & Futureloop (daily.ai) · Two Acquisitions

Launched a full-stack engineering career immediately after graduating, co-founding and joining two early-stage startups — both of which were acquired.

  • Co-founded Metamind, building the product and engineering foundation from the ground up; the company was subsequently acquired
  • Joined as a founding member of Futureloop, an AI-powered information curation platform that evolved into daily.ai and was also acquired
  • Gained deep early-career experience across the full stack — frontend, backend, APIs, databases, and cloud infrastructure
  • Operated in fast-moving startup environments, wearing many hats from product engineering to technical strategy

Academic Foundation

2020 — 2021
Master of Information and Data Science
University of California, Berkeley

Intensive graduate program focused on the full data science stack — from statistical theory and machine learning to scalable systems and data ethics.

  • Coursework in machine learning, natural language processing, and applied statistics
  • Deep focus on scalable ML systems, Bayesian inference, and research methodology
  • Capstone project applying NLP to healthcare clinical text analysis
2009 — 2013
B.S. Neuroscience  ·  Minor in Bioinformatics
University of California, Santa Cruz

Grounded in the biological sciences with early exposure to computational methods — a combination that would later define a career at the intersection of data science and life sciences.

  • Core studies in neurobiology, cellular physiology, and systems neuroscience
  • Bioinformatics minor covering sequence analysis, genomics, and computational biology
  • Research experience in laboratory settings applying quantitative methods to biological data

Beyond the Day Job

🐢
Ecuadorian Sea Turtle Conservation 501(c)(3)
Founded a nonprofit dedicated to protecting sea turtle nesting populations along Ecuador's Pacific coast. Deploying sensor networks, drone monitoring, and data-driven anti-poaching strategies to give these ancient species a fighting chance.
Founder · Active
🧬
AI for Conservation Genomics
Applying machine learning to eDNA and population genomics data to assess wildlife health and genetic diversity across threatened species corridors in South America.
Research · Ongoing
⚕️
Clinical LLM Research
Investigating fine-tuned large language models for clinical documentation, differential diagnosis support, and patient-facing health literacy tools — bridging the gap between AI capabilities and bedside care.
R&D · Healthcare AI
⚛️
Nanotechnology & Biomedical Futures
Tracking advances in nanoscale drug delivery, molecular machines, and targeted therapeutics. Exploring how computational modeling and AI will accelerate the design of nanomedical interventions.
Independent Study
🌊
Ocean Data Collective
Collaborating with marine biologists to build open data infrastructure for coastal ecosystem monitoring — aggregating sensor, satellite, and field observation data to track ocean health in near real-time.
Open Source · Collaboration
🤖
Agentic AI Systems
Building and experimenting with autonomous AI agent architectures — multi-agent workflows, tool-using LLMs, and long-horizon task planning — with a focus on safe, goal-directed behavior.
Exploration · AI Research

Let's Connect

Interested in collaborating on AI, healthcare, or conservation tech? I'm open to research partnerships, advisory roles, and mission-driven projects.