Introduction The phrase "Garbage-In-Garbage-Out" is well-known in data science, but it takes on new meaning in real-world projects, especially those involving satellite imagery. In these contexts where ground truth labels are often sparse, preprocessing becomes not just a step, but a cornerstone of success. Reflecting on my project, I have realized that understanding and preparing data account for about 70% (if not more) of the work and determines the quality of the results. Data preprocessing ensures that the inputs to your model are clean, structured, and tailored to the problem at hand. This is true for all kinds of data, whether tabular data, text, or images. However, the proper preprocessing steps come from initially "looking" at the data. That means preprocessing is dependent on the data and the task at hand. Understanding Satellite Images Satellite images are far more than just pictures; they encode a wealth of information about the spectral signature of a ...