Robeco, The Investments Engineers
blue circle

23-09-2025 · Visión

Using artificial intelligence for selecting small-cap stocks

Amid an ever-changing economic environment, small-cap equities present an equity sub-asset class that historically attracted flows in global large-cap market environments. In this insight of our special topic as part of our 5-year Expected Returns, our quant researchers Tobias Hoogteijling, Vera Roersma and Matthias Hanauer talk about how small caps are coming back into the spotlight.

    Autores/Autoras

  • Tobias Hoogteijling - Researcher

    Tobias Hoogteijling

    Researcher

  • Vera Roersma - Researcher

    Vera Roersma

    Researcher

  • Matthias Hanauer - Researcher

    Matthias Hanauer

    Researcher

The small-cap market

Small-cap stocks are publicly traded companies with market capitalizations typically ranging between USD 250 million and USD 10 billion.1 Small caps are an integral part of the global equity market portfolio, and many consider explicit allocation to them. “This is especially the case in today’s markets, where mega-cap tech stocks dominate large-cap indices, which also trade at elevated valuations,” says Hanauer. In contrast, small-cap indices comprise a much wider and more diversified mix of stocks, trading at a discount of approximately 20% compared to large caps.

The small-cap universe presents attractive alpha opportunities for investors who know where to find them, but also sizable challenges due to its broad and heterogeneous nature. Human analysts can only cover a limited number of names, and the more niche nature of small-cap firms makes specialization more difficult. 2

Sparse coverage means alpha opportunities

For example, on average, each small-cap stock is currently covered by about eight analysts, compared to 19 for large-cap stocks.3 Although fundamental managers might struggle to cover the investable small-cap universe, it fits naturally into a quantitative approach, as the cost of developing a quantitative model does not increase with the investable universe’s size. Conversely, as Roersma points out, quantitative strategies actually benefit from the universe’s breadth, from which they can systematically exploit a broad set of opportunities.

The small-cap universe’s size, diversity and complexity make it difficult for analysts and fundamental investors to cover it thoroughly. Its resulting market inefficiency and mispricing, however, can create larger opportunities to generate alpha. This greater degree of mispricing can be observed empirically, for example, by analyzing hypothetical top-minus-bottom portfolios based on well-known stock market factors such as value, momentum, and quality. Figure 1 illustrates that the annualized factor premiums and their corresponding volatilities are higher in the small-cap space than in the large-cap space.

Figure 1: Top-minus-bottom factor premiums in small-cap and large-cap space

Figure 1: Top-minus-bottom factor premiums in small-cap and large-cap space

The performance shown is hypothetical, based on theoretical models and assumptions, and is no guarantee for future results. Source: Robeco. The graph reports the annualized returns and volatilities for ‘small’ and ‘large’ cap factors. Factor returns are the return spreads between the top and bottom factor quintile portfolios. Portfolios are equal-weighted and rebalanced monthly. The investment universe consists of the constituents of the MSCI World (‘large’) and MSCI World Small Cap (‘small’) indices. The generic factors are Value (book/price, forward earnings/price), Quality (ROE, debt/assets), and Momentum (12–1 month price momentum). The composite factor is an equally weighted average of Value, Momentum, and Quality. The sample period spans from January 2000 to April 2025.

Predicting stock returns with machine learning

To model such stock market dynamics, we turn to machine learning (ML), a subcategory of AI. In this specific case, says Hoogteijling, we are interested in modeling which stocks are likely to yield high returns (and thus are potential buy candidates) and vice versa. One key advantage of ML techniques is their flexibility in capturing data patterns. In a simpler, traditional approach, one would often assume that relationships are linear. However, not all real-life relationships can be linearly approximated. ML algorithms such as random forests and neural networks4 do not impose such a strict functional form, but instead can be more freely shaped depending on the data given to the model. 5

Descubra el valor de la inversión cuantitativa

Suscríbase para conocer las últimas novedades y estrategias sobre inversión cuantitativa.

Explore la inversión cuantitativa

Performance of stock selection models

To assess the performance of these ML models – in other words, whether they can accurately predict whether stocks will outperform or underperform – we conduct a historical simulation study classifying the top and bottom half of stocks’ predicted returns. Then, we investigate to what extent the various models can correctly predict these two halves. If a model had no predictive power, about half of the stocks would be correctly predicted.

As illustrated in Figure 2, we find that both a generic factor model and an ML model outperform the ‘no predictability’ benchmark in terms of model accuracy. Furthermore, the ML model outperforms the generic value-quality-momentum factor model. Although stock-specific predictability of around 51-52% might seem low, small but consistent advantages can translate into long-term success. These stock-specific hit ratios translate into portfolio hit ratios of 68% and 78% for long-short quintile portfolios based on monthly returns, when sorted by generic and ML model predictions, respectively. These numbers highlight how even a slight improvement in predictive accuracy can lead to substantially better portfolio outcomes. 6

Figure 2: Average percentage of stocks accurately classified

Figure 2: Average percentage of stocks accurately classified

The performance shown is hypothetical, based on theoretical models and assumptions, and is no guarantee for future results. Source: Robeco. The figure on the left shows the time series’ average percentage of stocks correctly classified as being in the top half or bottom half of stocks in terms of next month returns, in a historical simulation study. The figure on the right shows the hit ratio of monthly long-short quantile portfolio returns, sorted based on predicted returns. The considered universes are the constituents of the MSCI World Small Cap Index over the period December 2000 to April 2025. The generic factor model consists of generic Value (book/price and forward earnings/price), Quality (ROE and debt/assets) and Momentum (12-1 month price momentum) factors.

Conclusion

In recent years, AI has shifted from hype to practical application, with machine learning in particular showing real promise for quantitative investing. Small-cap equities, with their breadth, inefficiencies, and limited coverage, offer especially fertile ground for these techniques.

Yet, as Hoogteijling explains, AI is not a silver bullet: success depends on careful model selection, interpretation, and explainability. When used with skill and discipline, however, machine learning can be a powerful addition to the quant investor’s toolkit, and a source of enduring edge in small-cap markets.

This article is an excerpt of a special topic in our five-year outlook.

Footnotes

1 As of the end of April 2025, based on the market capitalization numbers for the constituents of the MSCI World Small Cap Index, winsorized at the top and bottom 2.5%.
2 As of the end of April 2025, based on four bottom-up-calculated multiples (price-to-book, forward price-to-earnings, price-to-earnings, and price-to-dividend) for the MSCI World and MSCI World Small Cap indices.
3 Source: IBES.
4 A neural network is a type of ML model that learns to recognize patterns, inspired by how the human brain processes information. A random forest is a group of simple decision-making models (called decision trees) that work together to make more accurate choices than just one tree on its own.
5 For a broader view of the different applications of ML in asset management, we refer to Blitz et al. (2023) and Chen and Zhou (2023).
6 The performance shown is hypothetical, based on theoretical models and assumptions, and is no guarantee for future results.

Mantengamos la conversación

Manténgase al día de los constantes cambios en inversión sostenible y factorial, tendencias y crédito.

No se lo pierda
Robeco

El objetivo de Robeco es proporcionar a sus clientes unos rendimientos y soluciones de inversión superiores para que consigan sus objetivos financieros y de sostenibilidad.

Información importante
Los Fondos Robeco Capital Growth no han sido inscritos conforme a la Ley de sociedades de inversión de Estados Unidos (United States Investment Company Act) de 1940, en su versión en vigor, ni conforme a la Ley de valores de Estados Unidos (United States Securities Act) de 1933, en su versión en vigor. Ninguna de las acciones puede ser ofrecida o vendida, directa o indirectamente, en los Estados Unidos ni a ninguna Persona estadounidense en el sentido de la Regulation S promulgada en virtud de la Ley de Valores de 1933, en su versión en vigor (en lo sucesivo, la “Ley de Valores”)). Asimismo, Robeco Institutional Asset Management B.V. (Robeco) no presta servicios de asesoramiento de inversión, ni da a entender que puede ofrecer este tipo de servicios, en los Estados Unidos ni a ninguna Persona estadounidense (en el sentido de la Regulation S promulgada en virtud de la Ley de Valores). Este sitio Web está únicamente destinado a su uso por Personas no estadounidenses fuera de Estados Unidos (en el sentido de la Regulation S promulgada en virtud de la Ley de Valores) que sean inversores profesionales o fiduciarios profesionales que representen a dichos inversores que no sean Personas estadounidenses. Al hacer clic en el botón “Acepto” que se encuentra en el aviso sobre descargo de responsabilidad de nuestro sitio Web y acceder a la información que se encuentra en dicho sitio, incluidos sus subdominios, usted confirma y acepta lo siguiente: (i) que ha leído, comprendido y aceptado el presente aviso legal, (ii) que se ha informado de las restricciones legales aplicables y que, al acceder a la información contenida en este sitio Web, manifiesta que no infringe, ni provocará que Robeco o alguna de sus entidades o emisores vinculados infrinjan, ninguna ley aplicable, por lo que usted está legalmente autorizado a acceder a dicha información, en su propio nombre y en representación de sus clientes de asesoramiento de inversión, en su caso, (iii) que usted comprende y acepta que determinada información contenida en el presente documento se refiere a valores que no han sido inscritos en virtud de la Ley de Valores, y que solo pueden venderse u ofrecerse fuera de Estados Unidos y únicamente por cuenta o en beneficio de Personas no estadounidenses (en el sentido de la Regulation S promulgada en virtud de la Ley de Valores), (iv) que usted es, o actúa como asesor de inversión discrecional en representación de, una Persona no estadounidense (en el sentido de la Regulation S promulgada en virtud de la Ley de Valores) situada fuera de los Estados Unidos y (v) que usted es, o actúa como asesor de inversión discrecional en representación de, un inversión profesional no minorista.


El acceso a este sitio Web ha sido limitado, de manera que no constituya intento de venta dirigida (según se define este concepto en la Regulation S promulgada en virtud de la Ley de Valores) en Estados Unidos, y que no pueda entenderse que a través del mismo Robeco dé a entender al público estadounidense en general que ofrece servicios de asesoramiento de inversión. Nada de lo aquí señalado constituye una oferta de venta de valores o la promoción de una oferta de compra de valores en ninguna jurisdicción. Nos reservamos el derecho a denegar acceso a cualquier visitante, incluidos, a título únicamente ilustrativo, aquellos visitantes con direcciones IP ubicadas en Estados Unidos. Este sitio Web ha sido cuidadosamente elaborado por Robeco. La información de esta publicación proviene de fuentes que son consideradas fiables. Robeco no es responsable de la exactitud o de la exhaustividad de los hechos, opiniones, expectativas y resultados referidos en la misma. Aunque en la elaboración de este sitio Web se ha extremado la precaución, no aceptamos responsabilidad alguna por los daños de ningún tipo que se deriven de una información incorrecta o incompleta. El presente sitio Web podrá sufrir cambios sin previo aviso. El valor de las inversiones puede fluctuar. Rendimientos anteriores no son garantía de resultados futuros. Si la divisa en que se expresa el rendimiento pasado difiere de la divisa del país en que usted reside, tenga en cuenta que el rendimiento mostrado podría aumentar o disminuir al convertirlo a su divisa local debido a las fluctuaciones de los tipos de cambio. Para inversores profesionales únicamente. Prohibida su comunicación al público en general.

Advertencia - Uso fraudulento de Robeco en sitios web y redes sociales Más información