Articles

Invertibility 🇺🇸

May 04, 2019

In time series modeling, invertibility is the property of a model that allows the innovation process (also called the noise or disturbance process) to be expressed as a function of the observed series and its past values. This is particularly relevant for Moving Average (MA) models...

Negative Binomial Distribution 🇺🇸

April 27, 2019

Category: Statistics Notes

A discrete random variable X follows a negative binomial distribution if it represents the number of trials required to achieve a specified number of successes in a sequence of independent Bernoulli trials. The negative binomial distribution is often denoted as $X \sim \text{NegBinomial}(r, p)$, whe...

Cap Theorem 🇺🇸

April 26, 2019

Category: Databases Notes

The CAP Theorem states that a distributed system cannot simultaneously guarantee all three of the following properties...

Anomaly Detection 🇺🇸

April 15, 2019

Category: Stanford Machine Learning

Anomaly detection involves identifying data points that significantly differ from the majority of the data, often signaling unusual or suspicious activities. This technique is widely used across various domains, such as fraud detection, manufacturing, and system monitoring...

Logistic Regression 🇺🇸

April 14, 2019

Category: Stanford Machine Learning

Logistic regression is a statistical method used for classification in machine learning. Unlike linear regression, which predicts continuous values, logistic regression predicts discrete outcomes, like classifying an email as spam or not spam...

Wersje Pythona 🇵🇱

March 17, 2019

Category: Kurs Podstaw Pythona

Pyenv to potężne narzędzie open-source, które umożliwia programistom łatwe zarządzanie wieloma wersjami Pythona na jednym komputerze. Dzięki Pyenv można nie tylko instalować i przełączać się między różnymi wersjami Pythona, ale także izolować środowiska dla poszczególnych projektów. Jest to szczegól...

Petle 🇵🇱

March 15, 2019

Category: Kurs Podstaw Pythona

Pętle stanowią jeden z fundamentalnych elementów każdego języka programowania, umożliwiając wielokrotne wykonywanie wybranych instrukcji. Dzięki nim możemy powtarzać określone operacje na danych, co pozwala na znaczne uproszczenie i skrócenie kodu. W praktyce, bez pętli musielibyśmy wielokrotnie pow...

Custom Filters and Algorithms 🇺🇸

March 09, 2019

Category: Vtk Examples

Creating custom filters and algorithms in the Visualization Toolkit (VTK) opens up a world of possibilities for tailored data processing and visualization. By extending VTK's capabilities, you can carry out specialized techniques that meet the unique needs of your projects—whether it's for scientifi...

Dbanie o Jakosc Kodu 🇵🇱

February 26, 2019

Category: Kurs Podstaw Pythona

Kod może być składniowo poprawny, ale jednocześnie nieczytelny lub źle zorganizowany. Przestrzeganie pewnych standardów i konwencji pisania kodu jest niezbędne, zwłaszcza gdy w projekcie uczestniczy wielu programistów. Konwencje te opisane są w dokumentach PEP (Python Enhancement Proposals), a wśród...

Indexing Strategies 🇺🇸

February 18, 2019

Category: Databases Notes

Indexes play a crucial role in enhancing database query performance by allowing quick data retrieval without scanning every row in a table. Different indexing strategies are suited for various use cases and data types. Let's explore four common indexing strategies: B-tree, Bitmap, Hash, and Full-Tex...

Szablony 🇵🇱

January 31, 2019

Category: Od C Do Cpp

Szablony (ang. templates) stanowią fundament nowoczesnego programowania w języku C++. Są jednym z najbardziej potężnych narzędzi oferowanych przez ten język, umożliwiając programistom pisanie bardziej elastycznego i wielokrotnego użytku kodu. Dzięki szablonom, można tworzyć funkcje i klasy, które dz...

Logical Volume Management 🇺🇸

January 28, 2019

Category: Linux Notes

Think of data storage devices, such as DVDs, USB flash drives, and hard drives (HDDs or SSDs), as an entire cake. This cake can be cut into smaller slices or 'partitions'. These partitions are essentially divisions or sections within the storage device, helping to categorize or organize the storage ...

Stashing Files 🇺🇸

January 20, 2019

Category: Git Notes

In Git terminology, "stashing" refers to temporarily saving changes that are not ready to be committed. This allows you to switch branches or make other changes without losing your work...

Funkcje 🇵🇱

January 16, 2019

Category: Kurs Podstaw Pythona

Funkcje są blokami instrukcji zamkniętymi pod jedną nazwą i pozwalającymi na kontrolowanie z zewnątrz poprzez przekazywanie argumentów. Definicja funkcji polega na określeniu, które instrukcje należą do ciała funkcji, ile argumentów oczekuje funkcja oraz jaką nazwą będzie ona wywoływana w innych mie...

Type i and Type Ii Errors 🇺🇸

January 16, 2019

Category: Statistics Notes

Hypothesis testing is a core concept in statistics that allows researchers to evaluate assumptions about a population by examining sample data. In this process, we start with a null hypothesis, denoted by $H_0$, which represents a baseline or default position, and an alternative hypothesis, $H_a$, w...

Typ Wyliczeniowy 🇵🇱

January 14, 2019

Category: Od C Do Cpp

Typ wyliczeniowy enum w C++ umożliwia tworzenie zmiennych mogących przyjmować tylko pewien, wstępnie określony zestaw wartości. Każda z tych wartości reprezentowana jest przez czytelną nazwę, co przyczynia się do zwiększenia czytelności kodu. Od C++11 wprowadzono enum class, który oferuje silniejsze...

Git Server 🇺🇸

January 13, 2019

Category: Git Notes

Setting up your own Git server allows you to manage your version control system in-house, giving you control over where repositories are stored and how access is managed. By hosting your own server, you can customize the environment to better fit your team’s workflow, implement specific security mea...

Time Series Modeling 🇺🇸

January 10, 2019

Category: Statistics Notes

Time series modeling involves analyzing data points collected or recorded at specific time intervals to understand underlying structures and make forecasts. Various models, such as Autoregressive (AR), Moving Average (MA), and their combinations (ARMA, ARIMA), are employed to capture different aspec...

Data Definition Language Ddl 🇺🇸

December 12, 2018

Category: Databases Notes

Welcome to the world of Data Definition Language, or DDL for short. If you've ever wondered how databases are structured and how those structures are created and modified, you're in the right place. DDL is a subset of SQL (Structured Query Language) that focuses on defining and managing the schema o...

Capacity Planning 🇺🇸

November 27, 2018

Category: Databases Notes

Capacity planning is the strategic process of determining the necessary resources required to meet current and future demands of an application or system. It involves analyzing workloads, forecasting growth, and ensuring that the infrastructure can handle anticipated loads while maintaining optimal ...

Cubic Spline Interpolation 🇺🇸

November 10, 2018

Category: Numerical Methods

Cubic spline interpolation is a refined mathematical tool frequently used within numerical analysis. It's an approximation technique that employs piecewise cubic polynomials, collectively forming a cubic spline. These cubic polynomials are specifically engineered to pass through a defined set of dat...

Finding Files 🇺🇸

November 09, 2018

Category: Linux Notes

The find, locate, and which commands are commonly used for file search operations. The find command performs a comprehensive search using attributes such as name, size, and type. locate provides a faster, albeit periodically updated, search by filename. which locates the path of a program's executab...

Nosql Databases Intro 🇺🇸

November 07, 2018

Category: Databases Notes

NoSQL (Not Only SQL) databases are non-relational data storage systems that offer flexible schemas and scalable performance for handling large volumes of unstructured or semi-structured data. Unlike traditional relational databases that use tables and fixed schemas, NoSQL databases accommodate a wid...

Difference Equations 🇺🇸

November 06, 2018

Category: Statistics Notes

A difference equation (also known as a recurrence relation) defines each term of a sequence based on previous terms. In some cases, the general term of a sequence is given explicitly (e.g., $a_n = 3n + 2$, resulting in the sequence $5, 8, 11, \dots$). However, more commonly, a difference equation pr...

Observing Repository 🇺🇸

October 19, 2018

Category: Git Notes

Git offers several ways to inspect and understand what has changed in your codebase. Mastering these commands helps you monitor progress, spot issues early, and keep your project history organized. Think of it like reading the "track changes" feature in a word processor—but for your entire code proj...

Hardware 🇺🇸

October 18, 2018

Category: Linux Notes

Linux is a known for its ability to run on a broad range of hardware, from desktops and servers to embedded systems and IoT devices. Its modular kernel design allows efficient hardware management, enabling Linux to support various processors, GPUs, storage devices, and peripherals. With a vast colle...

Multithreading 🇺🇸

October 09, 2018

Category: Parallel And Concurrent Programming

Multithreading refers to the capability of a CPU, or a single core within a multi-core processor, to execute multiple threads concurrently. A thread is the smallest unit of processing that can be scheduled by an operating system. In a multithreaded environment, a program, or process, can perform mul...

Programowanie Funkcyjne 🇵🇱

September 24, 2018

Category: Kurs Podstaw Pythona

Programowanie funkcyjne, znane również pod angielską nazwą functional programming, to paradygmat programowania, który może wydawać się nieco odmienny od tradycyjnych metod. Zamiast skupiać się na sekwencji kroków i zmianie stanu programu, jak to ma miejsce w programowaniu imperatywnym, programowanie...

Liczby Losowe 🇵🇱

September 19, 2018

Category: Kurs Podstaw Pythona

Liczby losowe odgrywają kluczową rolę w wielu obszarach nauki, technologii i przemysłu, takich jak symulacje komputerowe, gry, analiza statystyczna, uczenie maszynowe, a także w badaniach fizycznych i matematycznych. W Pythonie za generowanie liczb losowych odpowiada moduł random, który zapewnia sze...

C vs Cpp 🇵🇱

September 15, 2018

Category: Od C Do Cpp

C i C++ to dwa języki programowania o wspólnych korzeniach, które odgrywają kluczowe role w dziedzinie informatyki. Chociaż C++ jest często określany jako rozszerzenie C, różnice między nimi są na tyle znaczące, że warto je szczegółowo omówić. W poniższym tekście przedstawimy dogłębną analizę obu ję...

Utilities 🇺🇸

September 01, 2018

Category: Linux Notes

We will discuss various tools that can be used on Linux systems for tasks such as taking screenshots, recording screens, preparing bootable sticks, and detecting malware. It provides brief explanations of each tool and includes installation and usage instructions...

Lambdy 🇵🇱

August 20, 2018

Category: Od C Do Cpp

Funkcje lambda, wprowadzone w standardzie C++11, stanowią jedno z najbardziej przełomowych rozszerzeń języka, umożliwiając tworzenie funkcji anonimowych bezpośrednio w miejscu ich użycia. Pozwalają one na definiowanie funkcji w sposób zwięzły i elastyczny, co znacząco ułatwia programowanie funkcyjne...

Procesy 🇵🇱

August 08, 2018

Category: Kurs Podstaw Pythona

Procesy to samodzielne jednostki wykonywane w systemie operacyjnym, każda z własną przestrzenią adresową i zasobami. Każdy proces działa niezależnie i jest izolowany od innych procesów. W związku z tym, komunikacja między procesami wymaga specjalnych mechanizmów, takich jak kolejki czy potoki. Proce...

Statistical Moments and Time Series 🇺🇸

August 08, 2018

Category: Statistics Notes

Understanding the behavior of time series data is helpful in various fields such as finance, economics, and engineering. Statistical moments, particularly the mean and standard deviation, play an important role in characterizing these processes. This section delves into how these moments describe ti...

Normal Curve and z Score 🇺🇸

August 05, 2018

Category: Statistics Notes

A normal distribution (often referred to as the normal curve or Gaussian distribution) is a continuous probability distribution that is symmetric about the mean, where most of the observations cluster around the central peak and taper off symmetrically towards both ends. Many real-world datasets suc...