Search company, investor...

Founded Year

2012

Stage

Unattributed | Alive

Total Raised

$167.35M

Last Raised

$16.77M | 2 yrs ago

Mosaic Score
The Mosaic Score is an algorithm that measures the overall financial health and market potential of private companies.

-77 points in the past 30 days

About Glia

Glia specializes in Unified Interaction Management, focusing on digital customer service technology for the financial services sector and beyond. The company offers a platform that facilitates seamless customer interactions across various channels such as messaging, video, voice, CoBrowsing, and AI automation, without the need for specific product names or technical jargon. Glia primarily serves the financial industry, including banking, credit unions, FinTech, insurance, and lending sectors. Glia was formerly known as Salemove. It was founded in 2012 and is based in New York, New York.

Headquarters Location

30 West 21st Street 4th Floor

New York, New York, 10010,

United States

781-612-2732

Loading...

Loading...

Expert Collections containing Glia

Expert Collections are analyst-curated lists that highlight the companies you need to know in the most important technology spaces.

Glia is included in 5 Expert Collections, including Unicorns- Billion Dollar Startups.

U

Unicorns- Billion Dollar Startups

1,244 items

F

Fintech

9,254 items

Companies and startups in this collection provide technology to streamline, improve, and transform financial services, products, and operations for individuals and businesses.

F

Fintech 100

749 items

250 of the most promising private companies applying a mix of software and technology to transform the financial services industry.

S

Sales & Customer Service Tech

579 items

Companies offering technology-driven solutions to enable, facilitate, and improve customer service across industries. This includes solutions pre-, during, and post-purchase of goods and services.

I

ITC Vegas 2024 - Exhibitors and Sponsors

627 items

As of 9/9/24. Company list source: ITC Vegas. Check ITC Vegas' website for any updates: https://events.clarionevents.com/InsureTech2024/Public/EventMap.aspx?shMode=E&ID=84001

Glia Patents

Glia has filed 20 patents.

The 3 most popular patent topics include:

  • clusters of differentiation
  • cytokines
  • neurological disorders
patents chart

Application Date

Grant Date

Title

Related Topics

Status

6/16/2017

8/11/2020

Web frameworks, Social networking services, Web applications, Web development, Social information processing

Grant

Application Date

6/16/2017

Grant Date

8/11/2020

Title

Related Topics

Web frameworks, Social networking services, Web applications, Web development, Social information processing

Status

Grant

Latest Glia News

STIE: Single-cell level deconvolution, convolution, and clustering in in situ capturing-based spatial transcriptomics

Aug 30, 2024

Abstract In in situ capturing-based spatial transcriptomics, spots of the same size and printed at fixed locations cannot precisely capture the randomly-located single cells, therefore inherently failing to profile transcriptome at the single-cell level. To this end, we present STIE, an Expectation Maximization algorithm that aligns the spatial transcriptome to its matched histology image-based nuclear morphology and recovers missing cells from ~70% gap area, thereby achieving the real single-cell level and whole-slide scale deconvolution, convolution, and clustering for both low- and high-resolution spots. STIE characterizes cell-type-specific gene expression and demonstrates outperforming concordance with true cell-type-specific transcriptomic signatures than the other spot- and subspot-level methods. Furthermore, STIE reveals the single-cell level insights, for instance, lower actual spot resolution than its reported spot size, unbiased evaluation of cell type colocalization, superior power of high-resolution spot in distinguishing nuanced cell types, and spatial cell-cell interactions at the single-cell level other than spot level. Introduction Recent rapid development in spatially resolved transcriptomics has enabled systematic characterization of cellular heterogeneity while retaining spatial context 1 , 2 , 3 , 4 , 5 , 6 , which is crucial for mapping the structural organization of tissues and facilitates mechanistic studies of cell-environment interactions. The available spatially resolved transcriptomic techniques have two major categories 7 : (i) in situ hybridization or in situ sequencing technologies, such as seqFISH 8 , MERFISH 9 , STARmap 3 , and FISSEQ 10 ; and (ii) in situ capturing technologies, such as spatial transcriptomics (ST) 4 , SLIDE-seq 2 , ZipSeq 11 , and HDST 12 . The first category achieved cellular or subcellular resolution by in situ visualization of predefined RNA targets, but the limited multiplex capacity hurdles them with only hundreds to thousands of genes profiled. In contrast, the second category captures transcripts in situ, followed by sequencing readout ex situ, thereby avoiding the limitations of direct visualization and allowing for an unbiased analysis of the complete transcriptome. The in situ capturing/barcoded area is referred to as “spot”. To date, the most predominant commercially available sequencing-based technique is 10X Visium, which is a spot-based spatial transcriptomics further developed from the ST technology in 2018. It is a higher throughput, more sensitive, and less custom protocol than the other alternative approaches, and very importantly, it supports both fresh frozen and formalin-fixed paraffin-embedded (FFPE) tissues. One of the primary technological limitations of the spot-based spatial transcriptomics resides in the low resolution of the spot, which often covers multiple cells. The current 10X Visium spot size is 55 μm in diameter, with the number of spot-covering cells ranging from 1 to 30, depending on the biological tissue 13 . To enhance the resolution, BayesSpace 14 proposed using Bayesian statistics to draw the neighborhood structure in spatial transcriptomic data and increase the resolution to the subspot level. Specifically, each ST spot was segmented into nine subspots, and each Visium spot was segmented into six subspots. Another method, called xfuse 15 , used a deep generative model to combine spatial transcriptome with histological image data, which can characterize the transcriptome of anatomical features in the micrometer resolution. Meanwhile, some other methods, such as stLearn 16 and SpaGCN 17 , have integrated the histology image to spatially smooth gene expression and improve clustering with contiguous spatial patterns. In addition to the computational enhancement, the resolution of spatial transcriptomics technique is improving itself, e.g., the new version of 10X Visium, under the name of “Visium HD” (claimed to be released in 2022 during this paper’s review), has been claimed to be released very recently with a much higher resolution. However, the spot resolution in cellular size does not equal single-cell level. Differing from single-cell sequencing technologies, the spots of the same size and fixed position cannot precisely capture the actual different-sized and randomly located single cells, which can still partially cover multiple cells in close proximity simultaneously, making spot-based spatial gene expression essentially fail to achieve the single-cell level regardless of spot size. This lack of single-cell level cannot be resolved by the current methods via only enhancing spot resolution computationally or technically, hindering the characterization of individual cellular spatial organization and cell-type-specific gene expression variation in spatial transcriptomics. In addition, on the Visium platform, 4992 spots of 55 μm in diameter are printed on the glass slide of \(6500\times 6500{{{{\rm{\mu }}}}{{{\rm{m}}}}}^{2}\) in area, leaving 100 μm distance between two spot centers and gene expression unmeasured from up to ~70% area \((1-\frac{4992\times \pi {\left(\frac{55}{2}\right)}^{2}}{{6500}^{2}})\). However, most current methods cannot fill this gap, resulting in a considerable loss of information. BayesSpace segmented the spot into subspot but cannot fill the gap area by only using the information from the spot, while xfuse combines spatial transcriptome with histological image to impute super-resolved expression within and between spots, although it is not the single-cell level. Third, compared with the fresh frozen tissue, the staining image from the FFPE tissue better preserves the cellular morphology, therefore facilitating more accurate cell/nucleus segmentation. However, the existing methods do not well utilize the matched image generated along with the expression profile by failing to take into account the cell segmentation, morphology features, and spatial location of every single cell captured by the image. To this end, we present STIE, which integrates spatial transcriptome data with matched histology image-based nucleus morphological information, to bridge the gap between spot resolution and single-cell level. By assuming that the cell type proportion of one spot inferred from spatial transcriptome and nucleus morphological features is similar, STIE uses the Expectation-Maximization (EM) algorithm to jointly model spatial transcriptome and nucleus morphology to find their consensus. Given cell-type transcriptomic signatures, STIE performed the single-cell level deconvolution and convolution for the low- and high-resolution spots, respectively; when provided no cell-type transcriptomic signatures, STIE performs single-cell level clustering. Moreover, STIE fills the gap between spots by recovering missing cells via neighborhood information. As such, STIE enables the real single-cell level and whole-slide scale analysis for spot-based spatial transcriptomics. Results Single-cell resolution does not equal single-cell level On the 10X Visium platform, the tissue sections are placed onto glass slides, where the spots of the same size are printed at fixed coordinates with barcoded Reverse Transcript (RT) primers. During tissue permeabilization, mRNA molecules diffuse vertically down to the solid surface and hybridize locally to the RT primers within the spot in situ. The cDNA-mRNA complexes are further extracted for library preparation and the sequencing readout. Different from single-cell RNA sequencing (scRNA-seq), which precisely captures each cell individually and profiles the gene expression of the whole cell, the spot at the predefined position is highly likely to partially cover multiple cells in close proximity simultaneously. However, these single cells cannot be resolved via only reducing spot size into cellular size computationally or technically, i.e., the single-cell resolution is not the single-cell level. We first took two recently proposed computational enhancement methods of single-cell resolution, BayesSpace 14 and xfuse 15 , as illustrative examples. We run BayesSpace on a 10X Visium mouse brain hippocampus data (Fig. 1a ). BayesSpace enhanced the ST resolution by segmenting one Visium spot into six subspots, and computationally impute the gene expression for each subspot. Because of smaller size, the subspot covers the cell in a small fraction, but due to the fixed size and location, it covers multiple cells simultaneously, therefore failing to achieve the single-cell level. Likewise, we investigated xfuse on a human breast cancer spatial transcriptomics data 4 (Supplementary Fig. 1 ), which enhanced the spot into even higher resolution, but it is still not the single-cell level due to the fixed size and location of the single enhancement unit. Fig. 1: Single-cell resolution is not single-cell level. a Computational resolution enhancement cannot achieve single-cell level. Illustration of the spot layout on the mouse brain hippocampus 10X Visium FFPE spatial transcriptome (left), the original gene expression summary at the spot level (middle), the BayesSpace-imputed gene expression summary at the subspot level along with the enlarged subspots on the H&E image (right). In the enlarged area, the white circle represents the original spot, and the red circle represents the enhanced subspot. b–e Systematic evaluation of spatial relationship between single cells and spots via simulation of high-resolution spots from real 10X Visium spatial transcriptomics FFPE data. b Nuclear morphological feature distribution of mouse kidney, mouse brain and human breast cancer. c Examples of simulated high-resolution spots on the real nuclear segmentation. The brown circle represents the high-resolution spot with 5 μm in diameter, and the black circle represents the nucleus in the real tissue. d The frequency of spots covering different numbers of cells, where the x-axis is the cell count covered by one single spot, and the y-axis represents the spot frequency. Only the spot that covers cells was considered. e The distribution of the cell area fraction covered by the spots in different diameters. In the box plots (b, e), center line represents median, lower and upper hinges represent first and third quartiles, whiskers extend from hinge to ±1.5 × IQR. The above distributions are drawn from 172,835 nuclei in mouse kidney, 31,546 nuclei in mouse brain, and 44,218 nuclei in human breast cancer, respectively. Source data are provided as a Source Data file. In addition to the computational imputation-based resolution enhancement, the resolution of spatial transcriptomics technique is also increasing itself. For example, the current spot size of 10X Visium is 55 μm in diameter, while the new version with enhanced resolution, called “Visium HD”, is claimed to be released very recently. However, it still holds a high chance to partially cover multiple cells. Next, we performed the cell segmentation on the spatial transcriptomics-matched H&E image and systematically investigated the spatial relationship between single cells and spots of different resolutions. We first explored a real high-resolution breast cancer HDST dataset 12 , which captures RNA from histological tissue sections on a dense, spatially barcoded bead array. However, like most of the current spatial transcriptomic techniques, HDST only supports the frozen tissue section, which is still more challenging to segment nuclei compared to FFPE (Supplementary Note  2 , Supplementary Fig. 2 ). Accurate nuclear segmentation is crucial here to understand the spatial relationship between spots and single cells. Therefore, we simulated the high-resolution spots from the real 10X Visium spatial transcriptome FFPE data. Three tissues, mouse brain, mouse kidney, and human breast cancer, were investigated to account for different cell sizes of cell types (Fig. 1b ). In addition to 55 μm, we simulated spots with diameters of 30 μm, 20 μm, 10 μm, and 5 μm to cover the whole tissue (Fig. 1c ), where the 5 μm diameter is even smaller than the average cell diameter of real tissues. With no loss of generality, we still left the gap area between spots. At 55 μm, each spot covered a median of 5–19 cells (Fig. 1d ; mouse kidney: 19, mouse brain cortex: 6, mouse brain hippocampus: 5, human breast cancer: 6), while with the spot size reduced to 5 μm, the cell count largely decreased to 1–2 cells (Fig. 1d ); mouse kidney: 2, mouse brain cortex: 1, mouse brain hippocampus: 1, human breast cancer: (1) Likewise, the cell area fraction covered by spots also largely decreases with the reduced spot size. At 55 μm, the cell area demonstrates a bimodal distribution, with the majority of cells fully covered by the spot, while at 5 μm, only ~5% of the cell area is covered by the spot by the median (Fig. 1e ). Of note, the high-resolution spot covered 1–2 cells by the median, but there were still a large number of spots covering multiple cells. Even at 5 μm, many spots covered at least two cells, which was consistent across the three tissues (Fig. 1d , mouse kidney: 62%, mouse brain hippocampus: 38%, mouse brain cortex: 29%, and human breast cancer: 37%). As such, one spot, even at a very small size, is still highly likely to cover multiple cells simultaneously with each cell covered at a very small fraction, making the spot still capture the mixed gene expression of cell types. Thus, the above gap between spot resolution and single-cell level raises the new computational challenges: “single-cell deconvolution” and “single-cell convolution” for low- and high-resolution spots, respectively. Differing from the traditional cell type deconvolution 18 , 19 , 20 that estimates cell-type proportion for spots, both single-cell deconvolution and convolution aim to resolve single cells from spatial transcriptomics spots, find their spatial location and identify their cell types, where the difference resides in that single-cell deconvolution refers to splitting the large low-resolution spot into single cells, while single-cell convolution refers to assembling the small pieces covered by adjacent high-resolution spots into one complete single cell. Overview of the STIE workflow To address the above challenges, we proposed an EM algorithm, called STIE, which used two pieces of information (Fig. 2 ), spot-level gene expression and matched histology image-based nucleus segmentation. STIE took as input (but is not limited to) the nucleus segmentation implemented in DeepImageJ 21 , called the “Multi-Organ Nucleus Segmentation model”. It was proposed by the “Multi-Organ Nucleus Segmentation Challenge 22 ” to train the generalized nucleus segmentation model in H&E stained tissue images. Next, the nuclear location along with the nuclear morphological features were extracted for every single nucleus. Fig. 2: Workflow of STIE by integrating spot-level spatial transcriptome with nuclear morphology. a Overview of STIE workflow. b Two modes of STIE: (1) single-cell deconvolution/convolution for low-/high-resolution spots and (2) single-cell clustering, i.e., signature-free single-cell deconvolution/convolution. c The flowchart of STIE single-cell deconvolution/convolution. The rationale of the STIE is that, given the cell type transcriptomic signature, the cell type proportion can be estimated from the gene expression of each spot, while a similar proportion can also be calculated from the histology image by segmenting cells/nuclei, extracting cell/nucleus morphological features, clustering single cells into groups, locating every single cell and calculating the cell cluster proportion covered by each spot. The gene expression-based and nuclear morphology-based cell type/cluster proportions (Supplementary Note  1 ) are theoretically the same, motivating us to model gene expression and nucleus morphology jointly, i.e., borrow information from nuclear morphology-based cell typing to refine gene expression-based cell type proportion estimation, followed by the other way around. Accordingly, we proposed STIE, an EM algorithm to model their joint likelihood as main loss function along with their difference in cell-type proportion estimation as penalty, thereby mutually drawing information and gradually refining each other to achieve their consensus. Information borrowing was implemented in Eqs. ( 4 ), ( 8 ) of the M-step (see “Methods”), which utilized both gene expression and nuclear morphology to refine the morphological model and the gene expression model, respectively. STIE has two modes: (1) Single-cell deconvolution/convolution: Given the cell type transcriptomic signature, STIE deconvolutes/convolutes the low-/high-resolution spots into single cells. The transcriptomic signature can be defined from the existing single-cell atlas of different tissues 23 , 24 , 25 . (2) Single-cell clustering (Signature-free single cell deconvolution/convolution): Given no cell type transcriptomic signature, STIE can perform clustering at the single-cell level. The clustering algorithm is built on deconvolution/convolution: given the number of clusters and the initial value of the clustering transcriptomic signature, STIE iteratively refines the clustering signature-based single-cell deconvolution/convolution and re-estimates the signature until the iteration converges. STIE has two hyperparameters, \(\lambda\) and \(\gamma\), representing the shrinkage penalty for nuclear morphology and the area surrounding the center of the spot, respectively. \(\lambda\) penalizes the difference between gene expression- and nuclear morphology-predicted cell type proportion, and accordingly, we chose \(\lambda\) for STIE via a heuristic strategy (Supplementary Information & Supplementary Fig. 3 ) to tweak the weights of contribution between gene expression and nuclear morphology to the final model fitting. Meanwhile, by modulating \(\gamma\), STIE aims to find the group of cells that are covered by the spot and can best fit the gene expression of the spot. Based on the observation on different real datasets, we set γ = 2.5x reported spot diameter for the 10X Visium datasets used in this paper (Supplementary Information), which gives the best concordance between the predicted and true spatial gene expression. Given no nuclear morphological features prespecified, STIE also selects the nuclear morphological features along with \(\lambda\) based on a grid search over the combinations of morphological features and \(\lambda\) (Supplementary Information & Supplementary Fig. 4 ). At last, the cells covered by the spot are assigned cell types based on both spot gene expression and cellular morphology, which also include those cells located outside the reported spot size but inside the enlarged spot area, \(\gamma\). Moreover, the cells located outside \(\gamma\) are assigned cell types using the cellular morphology and gene expression from their adjacent enlarged spot area. As such, STIE filled up the missing cells from the gap area between spots. We performed the simulation analysis by generating different combinatorial scenarios between nuclear morphology and spatial gene expression with either high or low noise. On the simulated dataset, we confirmed the convergence of STIE deconvolution (Supplementary Fig. 5 ) and evaluated the running time (Supplementary Fig. 6 ), which scales well with number of cell types and spots and is not significantly influenced by the number of marker genes. Moreover, we compared STIE with the other deconvolution methods, SPOTlight 18 , DWLS 19 , 26 , Stereoscope 27 , RCTD 20 , Tangram 28 and BayesPrism 29 , which only reply on gene expression (Supplementary Fig. 3 ). STIE outperformed all the others due to incorporation of nuclear morphology as additional information. Integration of gene expression and nuclear morphology enables single-cell level deconvolution/convolution in spatial transcriptomics We investigated the mouse brain 10X Visium FFPE spatial transcriptome (Fig. 3a–h , see the “Methods” section), and deconvoluted the hippocampal region based on scRNA-seq-derived mouse brain hippocampus cell-type transcriptomic signatures 30 . Three methods were first compared, SPOTlight 18 , BayesSpace 14 , and STIE, representing three categories of deconvolution methods, direct deconvolution, imputation-based deconvolution and our nuclear morphology and gene expression integration-based single-cell level deconvolution. First, SPOTlight 18 works on each spot as bulk gene expression and deconvoluted it into cell types at different proportions, achieving resolution at the spot level (Fig. 3a ). Second, BayesSpace 14 segmented each spot into subspots and used the information from spatial neighborhoods to impute subspot gene expression. Following up with SPOTlight, enhanced resolution of cell type deconvolution was achieved at the subspot level (Fig. 3b ). Third, by integrating the image with gene expression, STIE does not only identify the cell type for each single cell within the spot (Fig. 3c middle) but also recovers the cell outside the spot (Fig. 3c right), making the whole-slide-wide cell type deconvolution at the highest single-cell level (Fig. 3c left). Meanwhile, STIE learned the distribution of morphological features for each cell type, making the result more interpretable (Fig. 3j ). STIE primarily considered two categories of nuclear morphological features (but not limited to): size (Area, Major, Minor, Width, Height, Feret, and Perimeter) and shape (Round and Circular). The best features along with \(\lambda\) were selected by a grid search via evaluating the fitting of gene expression and nuclear morphology simultaneously (Supplementary Information; Supplementary Fig. 7 – 10 ). By comparing with the ground truth 25 , 30 (Fig. 3i ), STIE accurately found the position of all four cell types at the single-cell level. The other two methods found the cell types at certain proportions, but SPOTlight found high proportions of CA1 at all spots and failed to see CA2. Moreover, the estimated locations of cell types largely deviate from the truth. With the enhanced subspot resolution by BayesSpace, the follow-up SPOTlight more accurately estimated the positions of cell types, revealing the advantage of high resolution by BayesSpace; however, the CA2 proportion is still underestimated at the true CA2 locations but overestimated at the true CA1 locations. Besides SPOTlight, we compared five other methods, which only take gene expression as input to deconvolute cell types at spot level, including DWLS 19 , 26 , Stereoscope 27 , RCTD 20 , Tangram 28 and BayesPrism 29 (Fig. 3d–h , Supplementary Fig. 11 for their running time and memory usage). DWLS and RCTD showed better results than SPOTlight, with positions of cell types predicted more accurately. However, one of the shortcomings of the gene expression-based deconvolution method is that it cannot account for the cell count, therefore making misleading the high proportion of cell types within spots. For example, CA1 is primarily located in the pyramidal cell layer (Fig. 3i ), and the other layers have much fewer nuclei, but SPOTlight, DWLS and RCTD still suggest the existence of CA1 in those cell-sparse layers at certain proportions (Fig. 3a ). Second, the estimated proportion of one cell type within each spot is often accompanied by others (Fig. 3a, b, d–h ), but those locations have only one cell type as indicated by ISH (Fig. 3i ), such as the spots along the band of CA1, CA2, and CA3. This co-existence may result from the underlying co-linearity of the cell-type transcriptomic signatures (Supplementary Fig. 12 ). STIE addressed the above shortcomings by incorporating the nuclear segmentation and morphology as orthogonal features, facilitating better distinguishing among cell types. Fig. 3: Single-cell level deconvolution/convolution in spatial transcriptomics by STIE. a–j Mouse brain hippocampus 10X Visium FFPE spatial transcriptome. a Spot-level cell-type deconvolution using SPOTlight. In the enlarged area, each pie chart represents the proportion of cell types for the corresponding spot. b Subspot-level cell type deconvolution using BayesSpace followed by SPOTlight. In the enlarged area, each pie chart represents the proportion of cell types for the corresponding subspot. c Single-cell level deconvolution by STIE (left panel), which is the aggregation of cells captured by spots (middle panel) and cells missed by spots but recovered by STIE (right panel). In the enlarged area, the circle is the cell contour, with the color representing its cell type. Spot-level cell-type deconvolution using DWLS (d), Stereoscope (e), RCTD (f), Tangram (g), and BayesPrism (h). i Ground truth of mouse brain hippocampus cell types by In Situ Hybridization (ISH): CA1 (Mpped1), CA2 (Map3k15), CA3 (Cdh24) and DG (Prox1). The arrowhead indicates high expression. The figure is reproduced from Fig. 1D in ref. 30 . j Nuclear morphological feature distributions of cell types learned by STIE (422 CA1, 39 CA2, 115 CA3, 818 DG, and 872 Glia). k Single-cell level deconvolution by STIE on 10X Visium V2 Chemistry CytAssist FFPE spatial transcriptomics of two consecutive mouse brain hippocampus sections: section 1 (up panel) and section 2 (bottom panel). l–n Human breast cancer 10X Visium FFPE spatial transcriptome. l Single-cell level deconvolution by STIE. m Nuclear morphological feature distributions for cell types (2910 Bcells, 12,269 CAFs, 11,957 CancerEpithelial, 3120 Myeloid, 6997 Plasmablasts, 1920 PVL, and 4584 Tcells). Center line represents median, lower and upper hinges represent first and third quartiles, whiskers extend from hinge to ±1.5× IQR. n Manually annotated human breast cancer pathological regions. Single-cell convolution for the simulated high-resolution spatial transcriptomics data of the mouse brain hippocampus (o) and human breast cancer (p) using spots with 5 μm in diameter. Source data are provided as a Source Data file. To further investigate the accuracy and applicability, we tested STIE deconvolution on the recently released 10X Visium V2 Chemistry CytAssist mouse brain hippocampus FFPE spatial transcriptomics datasets. On both consecutive sections (Fig. 3k ), STIE deconvoluted the single cells accurately that are highly consistent with the ground truth. To investigate the influence of selection of marker genes on STIE deconvolution, we used SPOTlight to construct a new transcriptomic signature and use it to rerun STIE deconvolution. Although the new signature has only 39% overlapping genes with the original, STIE still deconvoluted the single cells accurately conferring high concordance with those using original signatures (Supplementary Fig. 13a–c ). Moreover, we randomly sampled the marker genes from the original signature and rerun STIE deconvolution to investigate its consistency with that using the full signature. As expected, the concordance is reduced with decreasing marker genes. However, with half marker genes, the concordance is still higher than 80%, except the CA2 that has few cells (Supplementary Fig. 13d ). In addition, we investigated the influence of image alignment to the STIE deconvolution, we gradually shifted the image to make it misaligned to the spot, rerun STIE, and evaluated its concordance with that under accurate alignment (Supplementary Fig. 14 ). As expected, the concordance is slowly reduced with the increasing misalignment of cells with spots. However, STIE still has 88% and 84% concordance with the accurate alignment, even if the cells are misplaced by a half spot and a whole spot, respectively. A slight misalignment, 10% spot size, does not significantly influence the cell typing (98% concordance). In addition, we investigated the human breast cancer 10X Visium FFPE spatial transcriptome dataset (Fig. 3l–n ). Based on the human breast cancer scRNA-seq-derived cell type transcriptomic signature 31 , we used STIE to deconvolute spots into single cells with cell typing, including Cancer and Normal epithelial cells, Cancer-associated fibroblast (CAF), Perivascular -like cells (PVL), Myeloids, B-cell and T-cell (Fig. 3l ), along with the morphological features characterized for each cell type (Fig. 3m ). Most cell types demonstrated distinct morphological features. Among immune cells, Myeloid is relatively easier to distinguish, showing a much larger nucleus size, whereas T-cell and B-cell are well-known to resemble morphologically, but STIE still estimated a slightly larger nucleus area of B-cell than T-cell, which is consistent with the previous observation 32 . We compared our result with the manual annotation by the pathologist (Fig. 3n ). Despite different resolutions, our single-cell deconvolution still shows a high resemblance to the annotated regions. On this dataset, we also tested the other deconvolution methods at the spot level, SPOTlight, DWLS, Stereoscope, RCTD, Tangram, and BayesPrism, to deconvolute the spot into cell type proportions (Supplementary Fig. 15 ). Overall, DWLS, Stereoscope, and RCTD predicted the distinct proportions of CancerEpithelial across spots, and the high-proportion spots tend to enrich for the tumor area, while SPOTlight, Tangram, and BayesPrism predicted more identical CancerEpithelial across all spots, at low, moderate, and high proportions, respectively. We specifically compared their predicted tumor area with the manual annotation (Supplementary Fig. 16a ) by evaluating the ARI and R2 (Supplementary Fig. 16b, c ). STIE gives the best concordance (ARI = 0.64, R2 = 0.64), which is slightly higher than SPOTlight (ARI = 0.54, R2 = 0.60), RCTD (ARI = 0.57, R2 = 0.6) and DWLS (ARI = 0.57, R2 = 0.59), and much higher than Stereoscope (ARI = 0.26, R2 = 0.13), Tangram (ARI = 0.38, R2 = 0.33), and BayesPrism (ARI = 0.4, R2 = 0.35). With the technical advancement, the spot resolution is also increasing itself, such as the 10X Visium HD. The high-resolution spot covering multiple cells at a small fraction can be similarly modeled using STIE. To test this hypothesis, we simulated the high-resolution spot mouse brain hippocampus spatial transcriptome data (Fig. 3o ; 5 μm in diameter) by using three pieces of information from the real low-resolution spot (Fig. 1a left; 55 μm in diameter), including the nuclear spatial location, STIE-deconvoluted cell types (Fig. 3c ), and the associated cell-type transcriptomic signature. Next, we used STIE to convolute the small pieces of cells from adjacent spots into single whole cells (Fig. 3o bottom) and found that the convoluted cell types were highly consistent with the true cell types by deconvolution. Likewise, we also simulated the high-resolution spot spatial transcriptomic data from the real human breast cancer low-resolution spot data (Fig. 3l ) and observed similarly high consistency between convoluted cell types (Fig. 3p ) and their original truth (Fig. 3l ). Despite no real high-resolution spot spatial transcriptomics FFPE data available so far, the above facts support the capability of STIE in convoluting small pieces of cells into single whole cells resulting from the high-resolution spot spatial transcriptomic data. STIE enhances spatial transcriptomic clustering to single-cell level Given cell type transcriptomic signatures, we deconvoluted/convoluted spots into single cells. As opposed, we checked if the cell type transcriptomic signature can be reversely estimated given cell typing (see “Methods”). We investigated the mouse brain hippocampus and human breast cancer (Fig. 4a, b , Supplementary Fig. 17 ) and found that all of our learned signatures correlated most with their corresponding scRNA-seq-derived signatures. Moreover, we used the non-negative least square (NNLS) to deconvolute our estimated signatures based on the scRNA-seq-derived signatures. Again, all of our learned signatures showed high proportions of the corresponding cell types. DWLS was used as an independent method and observed similar proportions. Despite small variation between the original scRNA-seq and the learned spatial transcriptomic signatures, the above results largely corroborated that we could estimate the single-cell level transcriptomic signature from the spot-level spatial transcriptome. Fig. 4: Single-cell level clustering in spatial transcriptomics by STIE. Cell type specific transcriptomic signature learning from 10X Visium mouse brain hippocampus FFPE (a) and 10X Visium human breast cancer FFPE (b). Spot-level clustering by K-means, SpaGCN, MUSE, subspot-level clustering by BayesSpace and single-cell-level clustering by STIE on 10X Visium FFPE mouse brain hippocampus (c), mouse brain cortex (e) and human breast cancer (g). Cell type deconvolution of spot-, subspot-, and single-cell-level clustering-derived CAGE in the mouse brain hippocampus (d), mouse brain cortex (f), and human breast cancer (h). For the mouse brain cortex, the cell types in the transcriptomic signature, which are not cortex layers and have small proportions, are not shown in the barplot. The box plot (h) represents the deconvoluted proportion of 9 cell types, where center line represents median, lower and upper hinges represent first and third quartiles, and whiskers extend from hinge to ±1.5 × IQR. The p-value was calculated based on one-sided Wilcoxon signed-rank test without adjustment for multiple comparisons. i The UMAP plot of human breast cancer scRNA-seq data from 26 primary tumors 31 . The top panel is the original cell typing of 10,060 single cells, and the bottom panel is the subset of cells that are mapped to the six STIE clusters. Spot-level clustering by K-means (left), SpaGCN (middle), and single-cell-level clustering by STIE (right) on the simulated high-resolution spot spatial transcriptome data of the mouse brain hippocampus (j) and human breast cancer (m). Cell type deconvolution of spot- and single-cell-level clustering-derived CAGE in the mouse brain hippocampus (k) and human breast cancer (n). l, o The consistency table of single-cell clusters between the simulated high-resolution spot-based STIE clustering and the original low-resolution spot-based STIE clustering as ground truth of the mouse brain hippocampus (c) and human breast cancer (g). Source data are provided as a Source Data file. Furthermore, we similarly estimated signatures from the whole transcriptome besides signature genes. On this basis, we implemented the STIE clustering algorithm when given neither cell typing nor transcriptomic signatures: we iteratively re-estimate the clustering transcriptomic signature and refine the clustering signature-based single-cell deconvolution/convolution until the iteration converges (see “Methods”). We tested this clustering algorithm on the mouse brain hippocampus (Fig. 4c ). As comparison, we also tested the other clustering methods on this dataset, including K-means that relies on only gene expression, SpaGCN 17 and MUSE 33 that also utilize the histological image, and BayesSpace at the sub-spot level (Fig. 4c ; Supplementary Fig. 18 – 21 ). At k = 5, STIE clustering was close to the ground truth and matched the cell types of CA1, CA2, CA3, DG, and Glia in STIE deconvolution. K-means accurately uncovered the clusters of CA1, CA3, DG, and Glia with CA2 missed. SpaGCN roughly found the CA1, DG, and Glia clusters. MUSE automatically determined the number of clusters k = 5 and found the CA1, DG, and Glia clusters. At the sub-spot level, BayesSpace clustering is more delicate with higher resolution, but it still failed to find the CA2 cluster even at a higher number of clusters (Supplementary Fig. 20 ). Furthermore, we calculated the average gene expression of each cluster (referred to as CAGE) and deconvoluted it based on the scRNA-seq-derived cell type transcriptomic signatures as another angle to evaluate the resolution of clustering methods (Fig. 4d ). The rationale is that if the cluster is single-cell level and all cells within the cluster belong to the same cell type, the deconvolution will demonstrate a high proportion of single-cell type for each cluster; otherwise, it will be the saturated proportions of multiple cell types. As shown in Fig. 4d , the CAGEs by other methods at spot-level or subspot-level all demonstrated a mixture of cell types. In particular, all of them showed a high proportion of Glia, suggesting that most spot/subspot clusters mix with Glia cells, which may mislead the biological interpretation of the cluster. In contrast, the CAGE by STIE (i.e., the STIE-estimated whole transcriptomic signature for single-cell clusters) recovers all cell type signatures at the single-cell type level, i.e., STIE found the matched clusters for all cell types, CA1, CA2, CA3, DG, and Glia, and each cluster gives the high proportion of its matched cell type by the cell type deconvolution. Likewise, we evaluated STIE clustering on the 10X CytAssist dataset of mouse brain hippocampus (Supplementary Fig. 22 ), which accurately uncover the clusters of CA1, CA2, CA3, and DG largely matching the ground truth. Moreover, we applied different clustering methods to the 10X Visium mouse brain cortex spatial transcriptome (Fig. 4e & Supplementary Fig. 23 – 26 ). At k = 6, we found clear layers by K-means, SpaGCN, BayesSpace, and STIE. Specifically, K-means at the spot level has thinner cluster 1 (black), cluster 2 (cyan), and cluster 6 (yellow), but thicker cluster 3 (red), while SpaGCN and BayesSpace showed more even clusters, especially cluster 1–3, but SpaGCN identified a much thicker cluster 6 (yellow) than the others. MUSE also identified 6 clusters automatically, but its clusters are more blurred than the other methods. STIE clustering takes the K-means at spot level as initial values, so it largely resembles K-means, but STIE showed more expanded cluster 1 (black) and cluster 6 (yellow). We further checked the cell type deconvolution based on the scRNA-seq signatures of the mouse brain cortex by Allen Brain Atlas 25 . Despite different patterns by NNLS and DWLS, both spot-level K-means, SpaGCN, MUSE, and subspot-level BayesSpace showed saturated proportions of cortex layers, whereas STIE demonstrated a significantly higher proportion for the layers from L2 to L6 (Fig. 4f ), which is also concordant with the consecutive spatial location of clusters. In addition, we tested STIE clustering on the mouse brain cortex 10X V2 Chemistry CytAssist FFPE datasets (Supplementary Fig. 27 ). On both section 1 and section 2, we observed the clear layers along with the good concordance between the CAGE and the scRNA-seq transcriptomic signatures. Further, to evaluate the impact of nuclear morphology on the STIE accuracy, we compared the above FFPE sections with the fresh frozen tissue section. We run STIE clustering at the single-cell level on the 10X Visium fresh frozen human dorsolateral prefrontal cortex 34 , which grouped the single cells into clear layers that resemble the manual annotation (Supplementary Fig. 28 ). By downgrading the single cells into spots, the clusters gave a moderate concordance with the manual annotation, which can be improved by more properly setting the initial values (Supplementary Fig. 29 ), but overall, it demonstrates lower performance and weaker robustness on the selection of nuclear morphological features compared to the 10X CytAssist FFPE mouse brain cortex (Supplementary Fig. 28 – 29 ). This is largely due to the low image quality of the fresh frozen section and the inaccurate nuclear segmentation (Supplementary Fig. 30 ), corroborating the importance of nuclear morphology to the accurate single cell clustering. Third, we investigated the 10X Visium human breast cancer FFPE spatial transcriptome (Fig. 4g & Supplementary Fig. 31 – 34 ). Overall, all K-means, BayesSpace, SpaGCN, and STIE demonstrated clear clusters, where the clustering at k = 6 demonstrates the best concordance with the STIE deconvolution (Fig. 3l ). By comparing with the manual annotation, their clusters are largely consistent with the tumor region (cluster 1 of K-means with R2 = 0.63, cluster 1&2 of BayesSpace with R2 = 0.49, cluster 1&2 of SpaGCN with R2 = 0.58, and cluster 1&2 of STIE with R2 = 0.64; Supplementary Fig. 35 ). Although MUSE identified 18 clusters automatically, it also uncovers clusters partially matching the tumor regions well (cluster 1–4 with R2 = 0.38). By checking the cell type deconvolution, cluster 1 and cluster 3 in the spot/subspot level clustering showed relatively higher proportions of CancerEpithelial and CAFs, respectively, while the other clusters all mixed with different cell types, especially CAFs. In contrast, STIE clusters showed much higher cell type proportions (Fig. 4h & Supplementary Fig. 36 ), therefore matching well with the known cell types: cluster 1 (14.3% out of all cells) for CancerEpithelial, cluster 2 (21.3%) for CancerEpithelial, cluster 3 (14.2%) for CAF, cluster 4 (35.8%) for Plasmablasts, cluster 5 (10.2%) for Myeloids, and cluster 6 (4.2%) for mixed T-cells, B-cells and Plasmablasts. Of note, different from the deconvolution (Fig. 3l ), STIE clustering identified two clusters for the tumor regions (cluster 1 and cluster 2), where cluster 2 was only located at the bottom tumor and the boundary of the other tumor regions. We aligned the clusters to ~100,000 human breast cancer single cells from 26 primary tumors 31 (Fig. 4i & Supplementary Fig. 37 ). SCINA 35 was used to assign cell clusters based on the clustering-derived transcriptomic signature. A total of 60.7% of cells were identified with known clusters assigned (Fig. 4i right), matching 6 cell types: CancerEpithelial, CAF, Plasmablasts, Myeloids, T-cells, and B-cells (Fig. 4i left). We found a group of closely neighboring clouds for cluster 2 (brown) in the big category of CancerEpithelial, which is separated from cluster 1 (steel blue), revealing a putative subtype of breast cancer. The CAGE of cluster 2 comprises a mixture of CAF and CancerEpithelial (Fig. 4h ). It may implicate the epithelial-mesenchymal transition (EMT) 36 , which is associated with tumor progression and migration. STIE has shown good clustering performance on the current spot size, i.e., 55 μm in diameter. Next, we tested whether STIE can also achieve single-cell level clustering on the high-resolution spot. We simulated the high-resolution spot spatial transcriptome from the 5 clusters in the mouse brain hippocampus (Fig. 4c ) and checked whether spot-level K-means, SpaGCN, MUSE, and single-cell-level STIE can recover those 5 clusters. We did not test the subspot level clustering, since the spot was even smaller than the cell size, with no need for further segmentation. Consequently, the K-means clusters on high-resolution spots are highly mixed together (Fig. 4j left), which heavily deviate from the truth (Fig. 4c right) and are even worse than K-means on low-resolution spots (Fig. 4c left). Consistently, the cell type deconvolution of CAGEs revealed that different cell types were mixed in all K-means clusters and that one cell type was lost (Fig. 4k left & Supplementary Fig. 38 ). These facts posed an even more difficult challenge for the K-means on the high-resolution spot. Compared to the K-means, SpaGCN performed much better clustering on high-resolution spots (Fig. 4j middle), which accurately found the region of CA1, CA2, CA3, and DG. The cell type deconvolution of CAGEs also recovers those four cell types despite slightly mixing with others (Fig. 4k middle), but two clusters were found for Glia with CA2 regions missed. However, as opposed to K-means and SpaGCN, STIE recovered all 5 clusters accurately, and all CAGEs were composed of single-cell clusters at a very high proportion (Fig. 4j–k ). Specifically, the single cell clusters from the high-resolution spots are highly concordant with those from the low-resolution spots (Fig. 4l ). A similar observation was also made for the simulated high-resolution spot data on human breast cancer (Fig. 4m–o & Supplementary Fig. 39 ). We also tested MUSE on the simulated high-resolution spatial transcriptomics data, which automatically determined 50 clusters for the mouse brain hippocampus and 159 clusters for the human breast cancer (Supplementary Fig. 40 ). The clusters are highly mixed and difficult to compare with the ground truth. Together, STIE can perform single-cell clustering for both low- and high-resolution spot spatial transcriptomics data, which outperforms spot-/subspot-level clustering in terms of both resolution and associated CAGEs. STIE reveals single-cell level insights in spatial transcriptomics As discussed above, differing from the single-cell gene expression profiling, the spot cannot always capture the complete cells, resulting in loss of information from cells at the spot boundary or contamination of cells from outside. Therefore, the first question is raised: does the gene expression captured by the spot derive from all cells located in the spot, partially from cells at the center of the spot, or even more cells outside the spot? Since STIE deconvolutes whole-slide-wide single cells with both cell typing and spatial information retained, we can check different areas surrounding the spot center and their covering cells. The area that shows the best concordance between the spot gene expression and its covering cell typing can provide the answer to the actual area captured by the spot. We referred to the size of that area as the “bona fide spot size” to distinguish it from the reported spot size. Accordingly, we fit the STIE model by varying the bona fide spot sizes with the spot center fixed and checked the model fitting between gene expression and cell type deconvolution. In the mouse brain hippocampus 10X Visium spatial transcriptome, the area in 2x-3x spot diameter demonstrated the best model optimization with the least root-mean-square error (RMSE) (Fig. 5a ). The analysis was repeated on the 10X Visium human breast cancer FFPE spatial transcriptome, and a consistent observation was made (Fig. 5b ). Moreover, we investigated the new released 10X Visium V2 chemistry CytAssist datasets from two mouse brain replicates (Fig. 5c ), and consistently observed the minimum RMSE around 2.5x spot diameter. Although a small variation was observed among datasets, the above facts suggest that the spot may capture even more gene expression from the spot surroundings, further motivating us to reevaluate the resolution of the spot. As such, the above simulation (Figs. 1 , 3o, p , 4j–n , 5h, i ) should have even higher spot resolution than we reported, since we simulated the bona fide spot size rather than spot size. In STIE, we introduced the hyperparameter γ to represent the bona fide spot size, and we have set γ = 2.5x reported spot diameter (55 μm) for all above analyses. Fig. 5: Relevant questions in spatial transcriptomics addressed by the single-cell level deconvolution. Identification of the bona fide area captured by spots in the mouse brain hippocampus (a) and human breast cancer (b). The x-axis represents the putative size of the bona fide area measured by the spot in the unit of a regular 10X Visium spot size (55 μm). The top panel represents the chart of bona fide spot size in the real tissue. The middle panel represents the cell count (y-axis) in the spot area (x-axis); the bottom panel represents the RMSE by fitting the STIE model using the cells within the corresponding area indicated by the x-axis. The distributions are drawn from 80 to 151 spots in mouse brain hippocampus and 1568–2508 spots in human breast cancer, respectively. c Identification of the bona fide area captured by spots in the 10X Visium V2 Chemistry CytAssist mouse brain hippocampus. The distributions are drawn from 120 to 230 spots in section 1 and 121–245 spots in section 2, respectively. The evaluation of image contribution to the cell type deconvolution in human breast cancer (d) and mouse brain hippocampus (e). The top panel represents the difference between cell-type proportions estimated from the spot gene expression and the nuclear morphological features; the bottom panel represents the RMSE of gene expression fitting. The x-axis represents the value of \(\lambda\) in Formula (7). The distributions are drawn from 151 spots in mouse brain hippocampus and 2,451 spots in human breast cancer, respectively, which are presented as mean values ±SEM. f Heatmap of the correlation between the cell type proportion within spots by SPOTlight, DWLS, Stereoscope, RCTD, Tangram, BayesPrism, and STIE. g The association between transcriptomic signature similarity and cell-type colocalization by SPOTlight, DWLS, Stereoscope, RCTD, Tangram, BayesPrism, and STIE. The two-sided p values are calculated for the Pearson’s correlation coefficients (n = 36) without adjustment for multiple comparisons. h–i High-resolution spots along with STIE holds the premise to distinguish nuanced cell types. h Random assignments of Memory Bcell or Naïve Bcell to the Bcell; CD8+ Tcell, CD4+ Tcell, NK cells, Cycling Tcell, or NKT cell, to the Tcell; and Macrophage, Monocyte, Cycling Myeloid, or DCs to the Myeloid. i The barplot represents the concordance of STIE deconvoluted/convoluted single cells with the simulation ground truth (h). The x-axis represents the simulated spot diameter, and the y-axis represents the concordance. The color refers to the cell type in the legend. Source data are provided as a Source Data file. Second, unlike the traditional cell-type deconvolution method, STIE integrates nuclear morphological information with gene expression. Therefore, we wanted to investigate whether the nuclear morphology can really contribute to the deconvolution. According to Eq. ( 7 ) (“Methods”), the parameter \(\lambda\) penalizes the difference between morphology- and transcriptome-based prediction. The extremely large \(\lambda\) renders the prediction entirely rely on nuclear morphology, which is first confirmed by the consecutively diminished difference between morphology- and transcriptome-based prediction along the increasing \(\lambda\) (Fig. 5d, e top). Therefore, we gradually attribute more to the morphology by increasing the shrinkage penalty \(\lambda\) and check the model fitting. Consequently, the RMSE slightly decreased and reached a minimum between 1e + 03 and 1e + 04 in the human breast cancer spatial transcriptomics dataset (Fig. 5e bottom), indicating the \(\lambda\) value of the best balanced contribution between transcriptome and morphology. Afterwards, the RMSE sharply increased with the extremely large \(\lambda\) value, suggesting that the optimized model cannot fully reply on the nuclear morphology. It is worth noting that the nuclear morphology has been incorporated into the STIE model even at \(\lambda\)=0 (Eq. 9 ) when determining the single cell type. In 10X Visium and V2 Chemistry CytAssist mouse brains (Fig. 5d bottom & Supplementary Fig. 41 ), we observed all flat RMSE at the beginning followed by the elevation at the large \(\lambda\), suggesting the best model has been achieved at \(\lambda\)=0. The difference between STIE deconvolution at \(\lambda\)=0 and other methods relying on only gene expression (Fig. 3a–h ) supports the additional contribution of nuclear morphology to the accurate cell typing. The similar difference can also be observed from the comparison between STIE clustering at \(\lambda\)=0 and K-means clustering on the CytAsssist mouse brain cortex (Supplementary Fig. 27 ). These facts corroborated the coordinated contribution of gene expression and nuclear morphology to cell typing in the spatial transcriptome. Third, we wanted to investigate cell type deconvolution-based cell type colocalization. The cell type proportion can be estimated within each spot given the gene expression. Subsequently, the correlation between the predicted cell-type proportions has been widely used to measure cell type colocalization as a clue of cell type interaction 27 , 37 , 38 . Here, we compared STIE with the other six methods that take gene expression as input and deconvolute cell types at spot level. Similar to Stereoscope, we calculated for each method the Pearson’s correlation (n = 2518) between cell type proportion within spots as a measure of cell type colocalization (Fig. 5f ). Overall, the transcriptomic signature-based methods suggested more cell type colocalization with more saturated and pronounced values, whereas STIE was more conservative (Fig. 5f , Supplementary Fig. 42 ). Specifically, all methods found a negative correlation between CAF and CancerEpithelial, indicating that CAF spatially dissociates from the tumor cell area, which is consistent with the gross histological distributions of fibrous tissue area and invasive carcinoma foci in the original H&E image (Fig. 3n ). In addition, a negative correlation between immune cells and CancerEpithelial was observed by all seven methods, suggesting less immune cell infiltration into tumor cell areas, which can be further related to the efficacy of immuno-oncology therapy. However, by checking the cell type transcriptomic signatures, we found that signature similarity was significantly correlated with cell type colocalization by all methods except STIE (Fig. 5g ), revealing a potential co-linearity bias within transcriptomic signatures for the methods relying only on gene expression. For instance, the original tissue slide is limited to the cancer area and may not contain “normal epithelia” histologically. However, due to the high correlation of the signatures between CancerEpithelial and NormalEpithelial (Pearson’s correlation coefficient r = 0.61 with n = 1071 and p = 5.1e–130), DWLS, SPOTlight, and RCTD all suggested their high colocalization. In contrast, STIE identified very few NormalEpithelial cells by utilizing the nuclear shape to further distinguish them from the CancerEpithelial, even though they are largely similar in the transcriptome. Therefore, STIE suggests no colocalization between NormalEpithelial and CancerEpithelial in this dataset. In addition, the misidentification of colocalization may also result from the lack of sparsity in statistical inference or failure to characterize cell counts by gene expression-based methods. For example, the cell type with an extremely small proportion is still kept in one spot, and the high proportions of cell types are still estimated for those cell-sparse areas (Fig. 3a–h ). These facts implied the benefits of accurate cell typing at single-cell level via exploiting both gene expression and nuclear morphological information, which facilitates less biased downstream exploration. At last, it would be very useful for digital pathology if STIE could distinguish more nuanced cell types, for instance, the heterogeneous T and B cells that are morphologically similar but transcriptomically different. Intuitively, the current low-resolution spot covers multiple whole cells simultaneously, making it almost infeasible to distinguish two nuanced T or B cells covered by the same spot. However, this could be potentially addressed by the upcoming high-resolution spot-based spatial transcriptomics, which is corroborated by the following simulation analysis. Based on the STIE-deconvoluted Tcell, Bcell, and Myeloid from the human breast cancer low-resolution spatial transcriptomics, we randomly assigned them the new nuanced immune cell type: Memory Bcell or Naïve Bcell to the Bcell; CD8+ Tcell, CD4+ Tcell, NK cells, Cycling Tcell, or NKT cell, to the Tcell; and Macrophage, Monocyte, Cycling Myeloid, or DCs to the Myeloid. The randomly assigned cell types were used as ground truth in the following investigation (Fig. 5h ). We used the scRNA-seq data 31 to rebuild a new highly-heterogeneous cell-type transcriptomic signature, and further, simulated spatial transcriptomics datasets with spot diameters of 55 μm, 30 μm, 20 μm, 10 μm, and 5 μm, respectively, for which the gene expression of each spot was simulated based on the fraction of its covering cell types and the corresponding high-heterogeneous cell type transcriptomic signatures. Of note, the spot size and gene expression are different between simulated datasets, while the pathology image and nuclear morphology remain the same. Using the high-heterogeneity cell type signature, we run STIE on each simulation to investigate if we can recover those nuanced cell types. The accuracy of low-heterogeneity cell types, including CAFs, Plasmablasts, PVL and Cancer Epithelial, is consistently high with small variation across spot sizes (Fig. 5i ). As opposed, the accuracy of high-heterogeneity immune cells demonstrated a high variation, and with the spot size becoming smaller, the accuracy is largely improved. At 5 μm, the nuanced immune cells are convoluted very accurately. The high-resolution spot still covers multiple cells partially, but it is less likely to completely cover two cells compared to the low-resolution spot, so that the cell type could be determined by the spot that captures the largest fraction of the cell and gives the highest probability. Thus, the high-resolution spot along with STIE single-cell convolution provides a higher power in distinguishing nuanced cell types. STIE enables the investigation of spatially resolved cell-cell interactions Given the STIE-obtained single cells with spatial information retained, we further investigated the spatially resolved cell-cell interaction in the tissue. We use STIE to re-estimate the cell-type whole-transcriptomic signature from human breast cancer spatial transcriptomics and take it as input for CellChat 39 , a toolkit and a database of interactions among ligands, receptors, and their cofactors, to quantitatively infer the cell-cell interaction. We investigated the interactions at \(\lambda\)=1e3 and \(\lambda\)=1e4 for STIE, respectively, given their similar measurements in the gene expression and nuclear morphology fittings (Supplementary Fig. 10 ). By further aggregating the ligand-receptor into pathways, we found that, in both settings, the interactions of extracellular matrix (ECM) pathways, such as COLLAGEN and FN1, predominantly occur between CAF, CancerEpithelial, and PVL, as well as the immune pathways, such as SPP1 and MIF, between immune cells, CAF and CancerEpithelial (Fig. 6a & Supplementary Fig. 43 ). We spatially mapped the interaction strength to the cell pairs under a cutoff of cell-cell distance that may permit the potential cell-cell interaction. The COLLAGEN pathway was taken as an illustrative example (Fig. 6b ). The outgoing signal is sent mainly from CAFs with a widespread distribution on the tissue (Fig. 6b left), while the incoming signal is received locally by the CancerEpithelial at the tumor region and the PVL at the tumor boundary, confirming that the spatially resolved interaction is driven not only by the ligand-receptor interaction strength but also by the cell abundance and spatial location. Accordingly, we incorporated the information of the neighboring cell pairs into the cell-cell interaction strength (Fig. 6c left) and obtained the global spatial cell-cell interaction strength (Fig. 6c right), which further suggests that spatial intercellular interactions primarily occur among CAFs and CancerEpithelial cells in this tissue. Fig. 6: Cell-cell communication with spatial information retained. a–f The spatially resolved cell-cell interaction in human breast cancer at \(\lambda\)=1e3 for STIE. a Heatmap of the cell-cell interaction strength of pathways in sender cell types (left) and receiver cell types (right). In sender and receiver heatmaps, the bar plots on top and right represent the total amount of cell-cell interaction within the cell type and pathway, respectively. b Cell-cell interaction in the COLLAGEN pathway. The color of the edge is indicated by the sender cell type on the left panel and the receiver cell type on the right panel, respectively, and the thickness is proportional to the interaction strength. The middle panel is the combined sending and receiving signals with nuclear segmentation and spots overlaid (The black color of edges and circles does not represent cell types). c The non-spatially resolved interaction strength between cell types (left) and the total amount of spatially resolved cell-cell interaction strength (right), which is the strength of interaction (left) times the number of cell pairs at proximity. The size of the dot is proportional to the number of cell types, and the edge thickness is proportional to the interaction strength (left) and the amount of interaction (right). d Heatmap of the contribution of pathways (top) and cell types (bottom) to the pattern of senders (left) and receivers (right). The enriched pathways in each pattern further overlapped with the top 30 strongest pathways (in Fig. 6a) and are listed beside the heatmap. e The sender and receiver cell-cell interaction patterns across the tissue. The color of the edge is indicated by the sender cell type and the receiver cell type in the sender and receiver patterns, respectively. f Cell-cell interactions in the FN1, SPP1, MIF and CXCL pathways. g–i Spatial cell-cell interactions in the mouse brain hippocampus. g Heatmap of the cell-cell interaction strength of pathways in sender cell types (left) and receiver cell types (right). h The non-spatially resolved interaction strength between cell types (top) and the total amount of spatially resolved cell-cell interactions (bottom). i Heatmap of the contribution of pathways (top) and cell types (bottom) to the pattern of senders (left) and receivers (right). In all above analyses, only cell-cell interactions within 3μm were counted to consider the potential autocrine and paracrine signaling. Source data are provided as a Source Data file. 10X Visium ST data preprocessing The raw reads and images of 10X ST Visium FFPE for mouse brain, mouse kidney and human breast cancer and the processed gene-level read count matrix and image of 10X CytAssist Spatial Gene Expression (V2 Chemistry) for mouse brain section 1&2 were downloaded from www.10xgenomics.com/resources . The raw data were preprocessed using Spaceranger ver. 1.3.0 with parameters in the downloaded “web_summary.html”. We used the automatic tissue detection by SpaceRanger workflow to filter out the spots located outside tissue areas based on the histological examination. Further, considering that the contamination of background RNA other than cellular RNA may cause automatic tissue detection to falsely identify spots outside the tissue, we also performed manual annotation, to exclude the location that do not overlap tissues. For the specific area of tissue, e.g., the mouse brain cortex and mouse brain hippocampus, we used Loupe Browser ver. 5.1.0 Visium Manual Alignment to manually annotate and select the area of interest and generate the.json file for the following analysis. The spaceranger ver. 1.3.0 count command was run with the parameter “--loupe-alignment” set to be the.json file. The processed gene-level read count matrix, histology image and nuclear segmentation of 10X Visium frozen human dorsolateral prefrontal cortex were downloaded from http://research.libd.org/globus . The nuclear morphological features were extracted using ImageJ. Image-based nucleus segmentation using deep learning The ImageJ plugin DeeplmageJ 21 with the “Multi-Organ Nucleus Segmentation” model 22 , 48 , 49 , 50 was used to perform nucleus segmentation. The Multi-Organ Nucleus Segmentation model was proposed by the “Multi-Organ Nucleus Segmentation Challenge 22 ”, aiming to train the generalized nucleus segmentation model in H&E stained tissue images to address the challenge that nucleus segmentation algorithms working well on one dataset can still perform poorly on other datasets with variation resulting from different organs, disease conditions, and even digital scanner brands or histology technicians. The cell location and morphology features were extracted for every single cell using ImageJ ROI Manager. To reduce memory consumption, the large H&E images were split into small images of 3000 × 3000 pixels. To avoid cells being split on the image boundary, each split image has four 100-pixel margins overlapping with its adjacent images. The overlapped cells were checked from the four 100-pixel margins based on their location distance and cell diameter. The larger cell among overlapped cells was kept for the following analysis. The ImageJ macro was incorporated into the STIE R package to run the ImageJ plugin for each split image. Construction of cell-type transcriptomic signature from the scRNA-seq data Given the single-cell data set, we used the buildSignatureMatrixMAST() R function developed by DWLS to select the marker genes and build the matrix of cell-type transcriptomic signature. The cell types in the single-cell data are expected to represent all cell types in the spatial transcriptomics. The clusters of the single cells that reveal the constituent cell types are required as input. Upon characterization of the cell types, differential expression analysis is performed to identify marker genes for each cell type. We define marker genes

Glia Frequently Asked Questions (FAQ)

  • When was Glia founded?

    Glia was founded in 2012.

  • Where is Glia's headquarters?

    Glia's headquarters is located at 30 West 21st Street, New York.

  • What is Glia's latest funding round?

    Glia's latest funding round is Unattributed.

  • How much did Glia raise?

    Glia raised a total of $167.35M.

  • Who are the investors of Glia?

    Investors of Glia include Wildcat Capital Management, Insight Partners, RingCentral Ventures, Plug and Play Fintech Accelerator, Entrepreneurs Roundtable Accelerator and 5 more.

  • Who are Glia's competitors?

    Competitors of Glia include Posh, Digital F2F, Agent IQ, Kasisto, Eltropy and 7 more.

Loading...

Compare Glia to Competitors

Clinc Logo
Clinc

Clinc specializes in conversational AI technology for the banking sector, providing solutions that enhance customer experiences. The company offers virtual assistants that understand and process natural language, enabling customers to perform banking transactions and receive personalized support without human intervention. Clinc's AI solutions are designed to reduce operational costs, improve customer satisfaction, and decrease reliance on live agents. It was founded in 2017 and is based in Ann Arbor, Michigan.

PolyAI Logo
PolyAI

PolyAI specializes in the development of artificial intelligence for various business sectors. The company offers enterprise conversational assistants that enable natural communication between businesses and their customers. It primarily serves the customer service sector. The company was founded in 2017 and is based in London, United Kingdom.

Posh Logo
Posh

Posh is a company specializing in conversational AI solutions for the banking sector. It offers products such as voice and digital assistants to automate customer service and enhance employee efficiency, as well as a knowledge assistant for information management. Posh's AI platform is designed to integrate seamlessly with existing banking processes, providing secure and efficient AI management. It was founded in 2018 and is based in Boston, Massachusetts.

B
Blinkin

Blinkin specializes in providing AI-assisted support solutions across various industries. The company offers a suite of companion experiences that facilitate product registration, setup, maintenance, troubleshooting, and exploration through multimodal AI, including visual guidance and AR video calls. Blinkin's solutions cater to the needs of enterprise creators and end-users, aiming to enhance self-service capabilities and streamline customer support. It was founded in 2020 and is based in Munich, Germany.

T
Talk Bank

Talk Bank is a neo-bank that reduces costs for customer support costs by AI chat-bot in messengers, providing white-label banking solutions for banks, insurance companies, and marketplaces. Talk Bank platform includes advanced remote KYC, behavioral anti-fraud solutions, service for freelancers and self-employed, transparent transaction control.

Kasisto Logo
Kasisto

Kasisto is a company that focuses on conversational artificial intelligence solutions, primarily in the banking and finance sector. The company offers a platform that provides intelligent digital assistants, designed to facilitate accurate, human-like conversations and empower teams with financial knowledge. Kasisto primarily serves global financial institutions, regional banks, community banks, and credit unions. It was founded in 2013 and is based in New York, New York.

Loading...

CBI websites generally use certain cookies to enable better interactions with our sites and services. Use of these cookies, which may be stored on your device, permits us to improve and customize your experience. You can read more about your cookie choices at our privacy policy here. By continuing to use this site you are consenting to these choices.