Hello,
I'm Mirosław Bober,
an aspiring
Big Data Engineer.

Building and scaling production Big Data systems.

--Big Data Engineer Profile
CREATETABLEbig_data_engineer(
nameVARCHAR(50)='Mirosław Bober',
titleVARCHAR(100)='Big Data Engineer',
skillsTEXT[]=ARRAY[
'MongoDB','Cassandra','Spark','SQL',
'Python','Docker','Linux','Kubernetes'
],
manages_clustersBOOLEAN=TRUE,
builds_pipelinesBOOLEAN=TRUE,
scales_to_petabytesBOOLEAN=TRUE
);
--Only hire if conditions met
SELECT*FROMbig_data_engineerWHERE
manages_clusters = TRUE AND
scales_to_petabytes = TRUE AND
'MongoDB' IN skills AND
CARDINALITY(skills) >= 5;

ABOUT ME

Who I am?

My name is Mirosław Bober. I am a hands-on Big Data engineer with a strong foundation in distributed databases, data pipeline architecture, and production-grade infrastructure. I specialise in designing and operating large-scale data systems built on MongoDB, Apache Cassandra, and Apache Spark. Comfortable working with bare-metal servers, Linux administration, rack-and-stack deployments, and containerised workloads in Docker. I'm proficient in SQL, Python, and pandas for data processing, and I thrive in environments where reliability, scalability, and performance matter most. I'm always looking for opportunities to build robust, data-intensive systems that power real business decisions.

Academic Foundations

Studied

Distributed Systems & The CAP Theorem

Deep theoretical understanding of consistency, availability, and partition tolerance trade-offs. Studied consensus protocols (Paxos, Raft) and their role in leader election and state replication.

Studied

Database Sharding & Replication Topologies

Academic focus on horizontal partitioning strategies (hash-based, range-based), resolving hotspots, data redistribution, and maintaining consistency across asynchronous read replicas.

Studied

NoSQL Data Modeling & Storage Engines

Theoretical knowledge of how data is physically stored on disk (B-Trees vs LSM-Trees), query planning, and optimizing schema design for read-heavy vs write-heavy distributed workloads.

Skills

MongoDB

PostgreSQL

MySQL

SQL

Python

Pandas

Numpy

Spark

Docker

Kubernetes

linux

Git

AWS

Nginx

pytorch

tensorflow

Memory Management

AI Agent Orchestration

ProjectsView All Projects

Big Data & Distributed Systems

Data pipelines, sharded databases, and scalable infrastructure architectures.

View Projects

Data Engineering

Data pipelines, data architecture and ETL processes.

View Projects

Other Projects

Various projects and experiments across many domains.

View Projects

Education

2023 - Present

Bachelor Degree in Computer Science

Specialization: Databases and Data Engineering

University of Economics in Katowice, Poland

2019 - 2023

IT & Computer Science Technical School

National Professional Qualification: IT Technician (EQF Level 4)

Professional Interests

Business Intelligence

Strategic use of data for decision-making.

Big Data Infrastructure

Designing distributed systems that store and process data at scale.

Data Engineering

Building reliable, scalable pipelines for data in motion and at rest.

Neural Networks

Designing layered computational models that mimic brain-like pattern recognition.

Cloud & DevOps

Deploying and automating infrastructure at scale.

DB Performance & Tuning

Squeezing maximum throughput out of databases under load.

CONTACT

Contact with me

If you have any questions or concerns, please don't hesitate to contact me. I am open to any work opportunities that align with my skills and interests.

Your Name:

Your Email:

Your Message:

miroslawbober4@gmail.com

Mysłowice, Poland

Hello, I'm Mirosław Bober, an aspiring Big Data Engineer.

Distributed Systems & The CAP Theorem

Database Sharding & Replication Topologies

NoSQL Data Modeling & Storage Engines

Big Data & Distributed Systems

Data Engineering

Other Projects

Hello,
I'm Mirosław Bober,
an aspiring
Big Data Engineer.