Distributed Systems & The CAP Theorem
Deep theoretical understanding of consistency, availability, and partition tolerance trade-offs. Studied consensus protocols (Paxos, Raft) and their role in leader election and state replication.
Building and scaling production Big Data systems.
--Big Data Engineer ProfileCREATETABLEbig_data_engineer(nameVARCHAR(50)='Mirosław Bober',titleVARCHAR(100)='Big Data Engineer',skillsTEXT[]=ARRAY[
'MongoDB','Cassandra','Spark','SQL',
'Python','Docker','Linux','Kubernetes'
],manages_clustersBOOLEAN=TRUE,builds_pipelinesBOOLEAN=TRUE,scales_to_petabytesBOOLEAN=TRUE);--Only hire if conditions metSELECT*FROMbig_data_engineerWHEREmanages_clusters = TRUE AND
scales_to_petabytes = TRUE AND
'MongoDB' IN skills AND
CARDINALITY(skills) >= 5;Who I am?
My name is Mirosław Bober. I am a hands-on Big Data engineer with a strong foundation in distributed databases, data pipeline architecture, and production-grade infrastructure. I specialise in designing and operating large-scale data systems built on MongoDB, Apache Cassandra, and Apache Spark. Comfortable working with bare-metal servers, Linux administration, rack-and-stack deployments, and containerised workloads in Docker. I'm proficient in SQL, Python, and pandas for data processing, and I thrive in environments where reliability, scalability, and performance matter most. I'm always looking for opportunities to build robust, data-intensive systems that power real business decisions.

Deep theoretical understanding of consistency, availability, and partition tolerance trade-offs. Studied consensus protocols (Paxos, Raft) and their role in leader election and state replication.
Academic focus on horizontal partitioning strategies (hash-based, range-based), resolving hotspots, data redistribution, and maintaining consistency across asynchronous read replicas.
Theoretical knowledge of how data is physically stored on disk (B-Trees vs LSM-Trees), query planning, and optimizing schema design for read-heavy vs write-heavy distributed workloads.
MongoDB
PostgreSQL
MySQL
SQL
Python
Pandas
Numpy
Spark
Docker
Kubernetes
linux
Git
AWS
Nginx
pytorch
tensorflow
Memory Management
AI Agent Orchestration
2023 - Present
Bachelor Degree in Computer Science
Specialization: Databases and Data Engineering
University of Economics in Katowice, Poland
2019 - 2023
IT & Computer Science Technical School
National Professional Qualification: IT Technician (EQF Level 4)
Business Intelligence
Strategic use of data for decision-making.
Big Data Infrastructure
Designing distributed systems that store and process data at scale.
Data Engineering
Building reliable, scalable pipelines for data in motion and at rest.
Neural Networks
Designing layered computational models that mimic brain-like pattern recognition.
Cloud & DevOps
Deploying and automating infrastructure at scale.
DB Performance & Tuning
Squeezing maximum throughput out of databases under load.
© Portfolio by Mirosław Bober
2026