Articles

Reshaping Arrays 馃嚭馃嚫

In data manipulation and analysis, adjusting the shape or dimensionality of arrays and matrices is a common task. Reshaping allows you to reorganize data without altering its underlying values, making it suitable for various applications such as data preprocessing, machine learning model input prepa...

Kod Bajtowy 馃嚨馃嚤

Kod bajtowy (ang. bytecode) w Pythonie to po艣rednia, niskopoziomowa reprezentacja kodu 藕r贸d艂owego, kt贸ra jest zrozumia艂a dla wirtualnej maszyny Pythona (Python Virtual Machine, PVM). Kiedy uruchamiamy skrypt Pythona, interpreter nie wykonuje bezpo艣rednio kodu 藕r贸d艂owego; zamiast tego, najpierw kompi...

Linear Equations 馃嚭馃嚫

Systems of linear equations are a cornerstone of linear algebra and play a crucial role in various fields such as engineering, physics, computer science, and economics. These systems involve multiple linear equations that share common variables. By utilizing matrix notation, we can represent and sol...

Partitioning vs Sharding 馃嚭馃嚫

When a database begins to sag under the weight of its own success, engineers reach for two closely-related remedies: partitioning and sharding. Both techniques carve a huge dataset into smaller slices, yet they do so at very different depths of the stack. By the time you finish these notes you shoul...

Aggregate Functions 馃嚭馃嚫

Aggregate functions in SQL are powerful tools that allow you to perform calculations on a set of values to return a single scalar value. They are commonly used with the GROUP BY clause to group rows that share a common attribute and then perform calculations on each group. Aggregate functions are es...

Tar and Gzip 馃嚭馃嚫

Working with files on Unix-based systems often involves managing multiple files and directories, especially when it comes to storage or transferring data. Tools like tar and gzip are invaluable for packaging and compressing files efficiently. Understanding how to use these commands can simplify task...

Observing Repository 馃嚭馃嚫

Git offers several ways to inspect and understand what has changed in your codebase. Mastering these commands helps you monitor progress, spot issues early, and keep your project history organized. Think of it like reading the "track changes" feature in a word processor鈥攂ut for your entire code proj...

Forward Difference 馃嚭馃嚫

The forward difference method is a fundamental finite difference technique utilized for approximating the derivatives of functions. Unlike the central and backward difference methods, which use information from both sides or preceding points, respectively, the forward difference method relies solely...

Cron Jobs 馃嚭馃嚫

Cron is a powerful utility in Unix-like operating systems that automates the execution of scripts or commands at specified times, dates, or intervals. It is used for tasks such as system maintenance, backups, updates, and more...

Log Files and Journals 馃嚭馃嚫

Understanding how logging works in Linux is like learning the language your system uses to communicate. Logs are the detailed records that your system keeps about its activities, and they are invaluable for troubleshooting, monitoring performance, and ensuring security. Let's embark on a journey to ...

Partitioning 馃嚭馃嚫

Partitioning involves dividing a large database table into smaller, more manageable pieces called partitions. This method helps improve query performance because the database can access only the relevant partitions when executing queries, rather than scanning the entire table. It also simplifies dat...

Inodes and Symlinks 馃嚭馃嚫

Inodes are critical as they store essential metadata about files, such as permissions and locations, allowing efficient file system management. Hard links are important because they let multiple file names point to the same inode, saving disk space by avoiding data duplication. Symlinks provide flex...

Archive 馃嚭馃嚫

git archive is your clean-room packager. It snapshots exactly what Git tracks at a commit鈥攏o .git folder, no stray build junk, no temp files. This means you can hand someone a tidy source bundle or ship code to a server without dragging history along...

Denormalization 馃嚭馃嚫

Denormalization might seem counterintuitive, especially if you're familiar with the principles of normalization that aim to reduce redundancy and dependency in databases. However, denormalization is a strategic process where we intentionally introduce redundancy into a database design. This approach...

Neural Networks Representation 馃嚭馃嚫

Neural networks represent a cornerstone in the field of machine learning, drawing inspiration from neurological processes within the human brain. These networks excel in processing complex datasets with numerous features, transcending traditional methods like logistic regression in both scalability ...

Searching Filtering and Sorting 馃嚭馃嚫

NumPy provides a set of functions for searching, filtering, and sorting arrays. These operations are helpful for efficiently managing and preprocessing large datasets, enabling you to extract meaningful information, organize data, and prepare it for further analysis or machine learning tasks. This g...

Dziedziczenie i Kompozycja 馃嚨馃嚤

Dziedziczenie oraz kompozycja to dwa filary programowania obiektowego, dzi臋ki kt贸rym mo偶emy pisa膰 elastyczny, modu艂owy i 艂atwy w utrzymaniu kod. Ka偶de z tych rozwi膮za艅 niesie ze sob膮 unikalne zalety i wi膮偶e si臋 z okre艣lonymi ograniczeniami, dlatego decyzja o ich zastosowaniu powinna wynika膰 z charak...

Petle 馃嚨馃嚤

P臋tle stanowi膮 jeden z fundamentalnych element贸w ka偶dego j臋zyka programowania, umo偶liwiaj膮c wielokrotne wykonywanie wybranych instrukcji. Dzi臋ki nim mo偶emy powtarza膰 okre艣lone operacje na danych, co pozwala na znaczne uproszczenie i skr贸cenie kodu. W praktyce, bez p臋tli musieliby艣my wielokrotnie pow...

Normalization 馃嚭馃嚫

Database normalization is a systematic approach to organizing data in a relational database. By minimizing redundancy and ensuring data integrity, normalization helps in efficiently structuring databases. The process addresses issues that arise when the same data is stored in multiple places, which ...

Covariance 馃嚭馃嚫

Covariance is a fundamental statistical measure that quantifies the degree to which two random variables change together. It indicates the direction of the linear relationship between variables...

Null Hypothesis 馃嚭馃嚫

Statistical hypothesis testing is a method used in research to make inferences about populations based on sample data. Understanding the concepts of null and alternative hypotheses, as well as how to calculate and interpret p-values, is crucial for conducting robust and meaningful analyses. This sec...

Informacje o Systemie Operacyjnym 馃嚨馃嚤

Praca z systemem operacyjnym jest nieod艂膮czn膮 cz臋艣ci膮 tworzenia aplikacji i skrypt贸w w Pythonie. Modu艂 os z biblioteki standardowej dostarcza bogaty zestaw funkcji, kt贸re pozwalaj膮 na interakcj臋 z systemem operacyjnym w spos贸b przeno艣ny i niezale偶ny od platformy. Dzi臋ki temu mo偶emy uzyskiwa膰 informa...

Grep 馃嚭馃嚫

The grep command is one of the most powerful and versatile tools in the Unix and Unix-like operating systems, including Linux and macOS. Its name stands for global regular expression print, and it is primarily used for searching plain-text data sets for lines that match a regular expression or a fix...

Inverse Matrix 馃嚭馃嚫

The inverse of a matrix A is denoted as A^-1. It is a unique matrix such that when it is multiplied by the original matrix A, the result is the identity matrix I. Mathematically, this is expressed as...

Interpolation 馃嚭馃嚫

Interpolation is the problem of reconstructing an unknown function from a finite set of exact data pairs...

Durability 馃嚭馃嚫

Durability is a fundamental principle in database systems that ensures once a transaction has been committed, its effects are permanent and will survive any subsequent system failures. This means that the data changes made by a transaction are safely stored and can be recovered even if the system cr...

Sharding 馃嚭馃嚫

Sharding is a method of horizontally partitioning data in a database, so that each shard contains a unique subset of the data. This approach allows a database to scale by distributing data across multiple servers or clusters, effectively handling large datasets and high traffic loads...

Jupyter Notebooks 馃嚨馃嚤

Jupyter Notebooks to zaawansowane 艣rodowisko pracy umo偶liwiaj膮ce tworzenie i udost臋pnianie interaktywnych dokument贸w, kt贸re integruj膮 kod z bogatymi tre艣ciami multimedialnymi takimi jak teksty, wykresy, animacje, a nawet elementy interaktywne. Chocia偶 najcz臋艣ciej kojarzone z j臋zykiem Python, Jupyter...

Databases Intro 馃嚭馃嚫

Databases are the backbone of modern applications, serving as organized repositories where data is stored, managed, and retrieved efficiently. Think of a database as a digital library where information is cataloged systematically, making it easy to find and use. Whether it's a simple contact list on...

Evaluating Performance 馃嚭馃嚫

Evaluating the performance of parallel computing systems is crucial for understanding their efficiency and identifying potential bottlenecks. Here are some metrics and concepts for evaluating performance...

Data Definition Language Ddl 馃嚭馃嚫

Welcome to the world of Data Definition Language, or DDL for short. If you've ever wondered how databases are structured and how those structures are created and modified, you're in the right place. DDL is a subset of SQL (Structured Query Language) that focuses on defining and managing the schema o...

Disk Usage 馃嚭馃嚫

Managing and monitoring disk usage is necessary for server maintenance, allowing administrators to identify disk space shortages caused by large log files, such as Apache or system logs, and malfunctioning applications that generate excessive data. Tools like df provide quick overviews of available ...

Random Walk 馃嚭馃嚫

The random walk is a fundamental and widely used time series model, often applied in finance to represent stock prices and other economic indicators. The idea behind the random walk is that the value of the process at time $t$ is the sum of its value at time $t-1$ and a random shock (or noise). Esse...

Stashing Files 馃嚭馃嚫

In Git terminology, "stashing" refers to temporarily saving changes that are not ready to be committed. This allows you to switch branches or make other changes without losing your work...

Regularization 馃嚭馃嚫

Regularization is a technique used to prevent overfitting in machine learning models, ensuring they perform well not only on the training data but also on new, unseen data...