Crisis Cast
Built end-to-end data pipeline with Kafka and PySpark processing 8000+ crisis posts per hour from multiple APIs. Optimized Spark workflows reducing runtimes by 35% through in-memory caching and designed MongoDB schemas cutting query latency by 50%.