Posts

Showing posts from May, 2026

Introducing the somhca R package

Image
Complex datasets often contain patterns that are difficult to interpret using traditional statistical methods alone. One effective approach is to combine self-organizing maps (SOM) with hierarchical cluster analysis (HCA) . Together, these techniques provide a powerful framework for exploring, visualizing and grouping high-dimensional data. SOM is an unsupervised neural network method that projects high-dimensional data onto a two-dimensional grid while preserving local relationships in the original data. Similar observations are positioned close to one another on the map, allowing patterns, relationships, trends and structures in complex datasets to become visually apparent, making SOM an excellent tool for dimensionality reduction and exploratory data analysis. However, SOM is not a clustering method by itself; it is primarily a topology-preserving mapping technique. When a large number of SOM units is used, similar observati...

somhca Package – Part 1: Training and Visualizing Self-Organizing Maps in R

Image
This is part 1 of a two-part series on the somhca R package. Start here for an introduction. Overview The four functions from the somhca R package presented in this post provide a complete workflow for preparing data, selecting an appropriate self-organizing map (SOM) configuration, training the SOM, and visualizing the resulting patterns. Together, they simplify the process of applying SOM-based exploratory analysis to high-dimensional numeric datasets such as spectra, sensor measurements, or other multivariate observations. These functions are particularly useful when working with complex datasets where pattern exploration, clustering, dimensionality reduction, and visualization are important goals. By automating tasks such as data preprocessing, SOM grid optimization, model training, and graphical interpretation, this workflow helps users build robust and reproducible SOM analyses with minimal manual tuning. loadMatrix() The loadMatrix() funct...

somhca Package – Part 2: Performing Hierarchical Cluster Analysis in R

Image
This is part 2 of a two-part series on the somhca R package. Start here for an introduction. Overview The three functions from the somhca R package presented in this post provide a complete workflow for hierarchical cluster analysis (HCA), from grouping observations after SOM training or other types of dimensionality reduction and pattern extraction, to retrieving and exploring the results. Together, these functions help transform complex, high-dimensional datasets into interpretable groups that can be validated, visualized and used for further analysis. Typical problems these functions help solve include: Simplifying interpretation of SOM results; Identifying natural groupings in complex datasets; Comparing clustering strategies (e.g., SOM-based vs PCA-based clustering); Detecting outliers or unusual observations; Assigning cluster labels for visualization or statistical analysis; Preparing grouped datasets for downstream machine learning or reporting. c...