Water Tech Jobs

Build your career. Transform the future of water.

Senior ML Researcher - Foundation Models

SatSure

Software Engineering, Data Science

Bengaluru, Karnataka, India

Posted on May 15, 2026

About SatSure

SatSure is a deep-tech decision intelligence company operating at the nexus of agriculture, infrastructure, and climate action. We turn earth observation data into actionable insights for governments, financial institutions, and enterprises across the developing world — at scale, with reliability.

Role

You will be the architect of the model’s latent space, designing foundation models for multi-spectral, multi-temporal, and multi-resolution geospatial data.

This is a hands-on role involving prototyping, experimentation, and large-scale training. You will work across representation learning, model scaling, and spatiotemporal modeling to build systems that generalize across sensors, geographies, and time.

Responsibilities

Representation Learning

Design and implement self-supervised learning (SSL) objectives (e.g., Masked Autoencoders, DINO-style methods, contrastive learning) tailored for geospatial data
Develop multi-modal representations spanning optical, SAR, elevation, and derived signals
Ensure representations transfer effectively across tasks such as segmentation, classification, and change detection
Design evaluation strategies to measure generalization across geographies, sensors, and time

Model Development & Scaling

Design and scale models based on Vision Transformers (ViT), hybrid architectures, or State Space Models (e.g., Mamba) to large parameter regimes
Apply modern training techniques such as RMSNorm, FlashAttention, mixed precision, and gradient checkpointing
Run scaling experiments, ablations, and architecture explorations grounded in empirical rigor
Leverage insights from scaling behavior to make compute-efficient decisions across model size, data, and training strategy

Temporal Dynamics

Develop methods to model time-series satellite data, capturing:
- Seasonal patterns
- Temporal dependencies
- Long-term land-use changes
Explore sequence modeling, memory mechanisms, and temporal tokenization strategies

Systems-Level Thinking

Design ML systems as end-to-end pipelines (data ingestion → curation → training → evaluation → deployment → feedback)
Make explicit trade-offs between model quality, latency, cost, and data freshness
Work with platform teams to optimize:
- Distributed training (FSDP, DeepSpeed)
- GPU utilization
- Data pipelines and experiment throughput
Build reusable components and abstractions, not one-off models

Preferred Background

Experience

5–8 years of experience in ML research or applied research roles
Experience in large-scale foundation model development (vision, multimodal, speech, or related domains)
Experience training and/or fine-tuning billion-parameter models
Experience working with sequence, video, or temporal data
Exposure to geospatial foundation models such as:
- Prithvi
- Clay
- Segment Anything Model (SAM) (nice to have)

Technical Skills

Expert-level proficiency in PyTorch or JAX
Strong experience with:
- Distributed training (FSDP / DeepSpeed)
- Large-scale datasets and training pipelines
Familiarity with transformer architectures and training dynamics
Bonus: CUDA / performance optimization experience

Additional Strengths

Familiarity with efficient scaling techniques (e.g., Mixture of Experts) is a plus
Strong experimental rigor and ability to design meaningful ablations
Track record of publishing or contributing to state-of-the-art research in representation learning or generative modeling

See more open positions at SatSure