ChatGPT: Hedge Fund’s New Edge

Henry Booth

ChatGPT: Hedge Fund’s New Edge

Artificial Intelligence (AI) is revolutionising industries, from healthcare to automotive. ChatGPT, a standout in AI, is known for its task automation and human-like text generation.


In finance, hedge funds are particularly keen on leveraging this technology. This article delves into how hedge funds are adopting and adapting ChatGPT to gain a competitive edge while highlighting the more cautious approach banks take.


The Hedge Fund Landscape

Adoption Rates

The finance sector has always been a hotbed for innovation, and hedge funds are no exception. According to a recent survey conducted by BNP Paribas' Capital Introduction team, 44% of money managers use ChatGPT professionally, indicating a significant shift towards AI adoption in the sector.


Another 10% are considering its use, showing strong interest within the community. The respondents, hailing from firms with a combined AUM of $250.5 billion, are primarily based in America, followed by EMEA. Interestingly, most managers using ChatGPT come from fundamental firms, while quant firms prefer their machine-learning programs. 


This high adoption rate among hedge funds is not merely a trend but a reflection of the sector's constant pursuit of efficiency and competitive advantage. While the BNP Paribas survey provides a snapshot, it's worth noting that the landscape is dynamic. Hedge funds are not just adopting ChatGPT; they are actively experimenting with it to find new use cases that can add value to their operations. 


Use Cases

ChatGPT adoption in hedge funds is about solving real-world challenges and enhancing various aspects of the business.


According to the BNP Paribas survey, those using ChatGPT professionally use it in many areas: 

  • Document Summarisation: 36% use it to summarise documents such as regulatory findings, broker reports, and academic papers.
  • Marketing: 35% employ ChatGPT in their marketing efforts.
  • Coding: 6% use it for coding tasks.
  • Email Drafting: 6% utilise it for drafting emails.
  • Legal Analysis: 6% use it for preliminary analysis of legal documents.


The 6% for coding may seem low, but it's worth noting that most respondents were from fundamentally driven hedge funds.


Leading hedge funds like Man Group and Citadel are at the forefront of this technological revolution. Man Group employs ChatGPT to review stacks of academic papers and for the preliminary analysis of data sets. Bloomberg reported Citadel was negotiating an enterprise-wide license for ChatGPT, seeing potential in tasks like translating code between languages. Another quant fund, Campbell & Co, uses ChatGPT to summarise internal research and write boilerplate code, demonstrating the technology's versatility.


Coding 

Coding is an obvious use case. ChatGPT has already had and will continue to affect coding significantly. Ignoring this trend would be a mistake.


Over 40% of the code on Github is already AI-generated.  In a controlled experiment, a group of coders using GitHub Copilot completed tasks 55% faster than those without. Copilot is a specialised version of GPT-3 trained on gigabytes of software code to autocomplete instructions, generate entire functions, and automate other parts of writing source code. 


Code Generation:
ChatGPT excels in generating initial code drafts, serving as a valuable assistant to skilled programmers. While it can't yet produce fully functional code for a non-specialist, it's a powerful tool for getting the first draft down.


Bug Identification:
One of the most time-consuming tasks for developers is debugging. ChatGPT can swiftly identify simple bugs like extra spaces or missing semicolons, freeing developers to tackle more complex, structural issues.


Error Understanding:
Developers often venture outside their areas of expertise, whether it's a new programming language, hardware or an unfamiliar API. ChatGPT can demystify errors, offering guidance on resolving them without needing external help, thus streamlining the development process.


Documentation
: Inadequate documentation can be a significant bottleneck, especially when developers are in a flow state. ChatGPT can auto-generate documentation as code is written, mitigating future headaches and ensuring smoother handoffs between team members.


Collaborative Intelligence:
One super-size fund is taking ChatGPT's capabilities further. They're using it to make intra-team recommendations, effectively learning from each coder's successes and challenges to offer timely advice to others facing similar issues.



Well-Being

Interestingly, a side effect of Copilot, as reported in the survey, was the effect on coders' well-being. According to the study, 60–75% of developers reported a heightened sense of job satisfaction, reduced levels of frustration, and an increased ability to concentrate on tasks that they find genuinely fulfilling when using GitHub Copilot.


Another intriguing facet of GitHub Copilot's impact is its role in mental energy conservation—a critical factor in a developer's daily grind. The research indicates that 73% of developers found it easier to maintain their workflow, commonly called 'staying in the flow,' when using this AI-powered tool. Even more striking is that 87% of developers noted that GitHub Copilot significantly reduced mental exertion during monotonous, repetitive coding tasks.


These are encouraging results in an era where well-being and work/life balance are at the forefront of most minds. 


Automating Routine Tasks

As reported by Bloomberg, hedge funds like Man Group and Citadel are looking to use ChatGPT to handle routine and mundane tasks, the "grunt work". This could include data scraping, preliminary data analysis, and even initial stages of research. By automating these tasks, hedge funds free portfolio managers and researchers to focus on more strategic activities.


Marketing

According to a BNP Paribas survey, 70% of hedge fund managers who have adopted ChatGPT use it for marketing purposes. They're leveraging the technology to generate persuasive text for investor presentations, newsletters, and social media campaigns. This becomes particularly advantageous for small to mid-sized funds that may not have the luxury of a dedicated marketing or research team, effectively levelling the playing field with their larger counterparts.


Sentiment Analysis

Sentiment analysis is another promising use case. ChatGPT can process vast amounts of news articles, social media posts, and financial reports to gauge market sentiment. Then, trade on market sentiment or news events. ChatGPT is much better than previous natural language processing models (NLP). 


A recent research paper proves “
that GPT models deliver a considerable improvement in classification performance over other commonly used methods. We then demonstrate how the GPT-4 model can explain its classifications that are on par with human reasoning."


MAN AHL recently backed this up, publishing results where ChatGPT outperformed sentiment-based word counting. The article also points out a significant cause for difference. Classic sentiment models are trained under supervision to label words. ChatGPT shifts to a generative model using deep learning neural networks, allowing a deeper understanding of the text. It is better able to appreciate other words in the sentence and the context of a sentence to glean sentiment better. 


However, a limitation is that ChatGPT is trained on a broad range of internet data. Its performance could be improved if trained in a specific niche manner—for example, training on the fed meetings or considering only specific financial news. Additionally, the AHL article points to the importance of prompts, as the prompts written affected the outcome. 


Although… if everyone starts using ChatGPT for sentiment analysis, does that mean the alpha will be arbitraged away? Could sentiment analysis become a crowded trade like index rebalance?


Risk Assessment

ChatGPT can be programmed to monitor multiple data sources continuously for potential market risks and opportunities. It can analyse market news, economic indicators, and social media sentiment to provide real-time alerts. This enables portfolio managers to make informed decisions quickly, a crucial advantage in volatile markets.


Back-Testing Strategies

ChatGPT can also be invaluable in the back-testing phase of strategy development. Automating the back-test coding can significantly speed up the validation process for new trading algorithms. This allows quants to iterate through potential strategies more efficiently, discarding the ineffective ones and refining the promising ones.


Strategy Creation

It cannot create an entire strategy. However, ChatGPT can assist in the initial stages of strategy creation. By analysing vast datasets, it can identify potential patterns or anomalies that human analysts might overlook. This can serve as the foundation for new trading strategies, which can be further refined and back-tested.



By embracing ChatGPT, hedge funds are not just streamlining their operations but are also opening up new avenues for innovation and efficiency. As the technology evolves, we'll see even more creative and impactful use cases emerge in this sector.


The Broader Context


The adoption of ChatGPT in hedge funds is part of a larger technological wave sweeping the financial sector. While hedge funds quickly experiment with ChatGPT, banks like Goldman Sachs and JPMorgan are more cautious, citing regulatory issues.


However, banks aren't entirely avoiding generative AI. Goldman Sachs uses generative AI tools to assist its software developers in writing and testing code. It has initiated a "proof of concept" using generative AI to assist in coding tasks. While the bank aims to make human coders "more productive" rather than replace them, it's a sign that banks are open to controlled experimentation with this technology.


As reported by eFinancial Careers, Vacslav Glukhov, head of EMEA quant research for e-trading at JPMorgan, discusses the potential impact of ChatGPT on various roles within banks. He believes that ChatGPT will mostly automate jobs that involve commentary on figures and rehashing existing ideas. He suggests that while many jobs could be automated, roles that require human intelligence and the ability to predict unusual situations will still be crucial.


David Siegel of hedge fund Two Sigma and Marty Chavez of investment management firm Sixth Street offer a sceptical view. Siegel mentions that "AI has been having an impact for decades, this stuff isn't brand new," and that "people are reading too much into it". The top quant hedge funds have used machine learning and artificial intelligence for many years. 


Chavez adds that ChatGPT and similar technologies will never achieve the "holy grail" of predicting the stock market because their strengths lie in analysing stable datasets, unlike the stock market. Additionally, Vacslav Glukhov emphasises that while ChatGPT can handle routine tasks, it can't replace human creativity and originality.


AI has made significant strides in various sectors, but its effectiveness in predicting stock market movements remains complex and unresolved. The "Holy Grail" for financial markets is an AI that can predict stock prices more accurately than humans, a challenge that remains unmet.


Future Trends

As we've seen, the adoption of ChatGPT in hedge funds is already quite extensive, but what does the future hold? 


Based on current trends and the evolving needs of the industry, here are some directions we can expect:


As the technology matures, we can anticipate more robust compliance features that make it easier for hedge funds to navigate the regulatory landscape. This could make banks more comfortable adopting ChatGPT, as seen in their cautious approach. Glukhov is less convinced that ChatGPT will replace humans in complex risk and compliance roles. He argues that the technology is not numerically oriented enough to replace model validation quants.


The adoption of ChatGPT is poised to have far-reaching implications. Over the next 12 months, the technology could shrink workforces and disrupt the quant and coding market, lowering the bar for smaller funds to enter the space.


Human + Machine Collaboration

The conversation around ChatGPT often centres on its capabilities for task automation and efficiency. However, its more nuanced role is in augmenting human capabilities. Marco Argenti from Goldman Sachs notes that the technology's goal is to make human coders "more productive," not to replace them.


The real potential of ChatGPT may not lie solely in its standalone capabilities but in how it can be guided by human insight. Consider this: What if the key advantage is not what ChatGPT can do autonomously but what it can achieve when directed by thoughtful human questioning?


Ray Dalio's perspective from "Principles: Life and Work" is apt:
“Smart people are the ones who ask the most thoughtful questions, as opposed to thinking they have all the answers. Great questions are a much better indicator of future success than great answers.”


In this context, you don't need to be an expert coder or data scientist to extract valuable insights. If you know the right questions to ask, ChatGPT can provide the answers. This is less about automation and more about broadening the scope of who can participate in complex decision-making.


This form of human-machine collaboration could be a significant asset. It's not merely about speed or efficiency; it's about enabling more people to engage in tasks that previously required specialised skills. The edge may go to those who can ask the right questions and, with tools like ChatGPT, find the answers they need.


Expanding Use Cases

The BNP Paribas survey indicates that hedge funds are looking to expand the use of ChatGPT in areas like marketing and summarising documents. Given the technology's versatility, we can see it applied in even more innovative ways, such as advanced data analytics or predictive modelling.


Integration with Other Technologies

ChatGPT will integrate more with other AI and machine learning technologies, creating more comprehensive solutions. For example, combining ChatGPT's natural language capabilities with predictive analytics tools could offer more nuanced trading strategies.


Democratisation of Technology

As AI tools become more accessible and affordable, smaller hedge funds may adopt ChatGPT to level the playing field with larger competitors. This could be a game-changer in an industry where scale often dictates success.


Talent Dynamics

Using advanced technologies like ChatGPT could shift the talent dynamics in the industry. While the need for human expertise will never be entirely replaced, the roles and skills required may evolve, placing a higher premium on adaptability and tech-savviness.


Moreover, adopting ChatGPT and similar technologies is a strategic move to attract top talent. In an industry where the war for talent is fierce, especially among quantitative researchers and portfolio managers, cutting-edge technology can be a differentiator. It signals prospective employees that the firm is forward-thinking and open to leveraging technology for better decision-making and operational efficiency. 


Based on the GitHub survey and the massive reduction in work-related stress, it is plausible to see developers, coders, and quant move towards platforms and work environments that make their jobs easier. It will get to a point where, if you aren't running Copilot or similar, developers and quant won't join. 


Plus, it removes the need to hire more staff if you can turn your team into 10x coders! 


Conclusion

The financial sector stands at a crossroads with the advent of ChatGPT. Hedge funds, ever agile and innovative, are capitalising on this technology to sharpen their competitive edge. In contrast, banks are treading cautiously, weighed down by regulatory considerations.


To be clear, ChatGPT isn’t going to create an edge directly, a.k .a. alpha. (Unless you’re sentiment-based). It will create an edge away from pure alpha. It will give you an edge in helping you raise assets better than the competition with more persuasive marketing material. An edge by allowing coders to code quicker. Or an edge by improving staff well-being, productivity and turnover, attracting more coders. Or an edge by iterating quicker and homing in on the right solution quicker than your competitors. 


For hedge funds, the future with ChatGPT looks promising. Those who adapt and evolve with this technology stand to gain significantly, not just in operational efficiency but also in talent acquisition. In a fiercely competitive talent market, not leveraging ChatGPT could become a deal-breaker for prospective employees.


Moreover, the technology's potential to boost staff productivity and well-being could soon make it indispensable in the workplace.


As we peer into the future, one thing is unmistakable: ChatGPT and similar large language models will redefine the contours of the financial industry. Firms that successfully navigate this intricate landscape will set the pace in the coming years.


In a zero-sum game like trading, every edge counts. ChatGPT offers that edge—be it in coding, analysis, or staff productivity. If you're not already exploring this technology, you're risking more than just falling behind—you're risking obsolescence.


So, the question remains: Will your fund lead the charge in embracing ChatGPT, or will it watch from the sidelines?



References

https://www.bloomberg.com/news/articles/2023-07-31/hedge-fund-survey-reveals-how-money-managers-use--chatgpt

https://www.efinancialcareers.co.uk/news/2023/03/chat-gpt-jobs-banks

  • "Can ChatGPT Beat Word-Counting Humans?" - Man Institute

https://www.man.com/maninstitute/can-chatgpt-beat-word-counting-humans

  • "Understanding and Improving US Bank Sentiment Analysis with ChatGPT" - SSRN

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4399406

  • "GitHub Copilot is Generally Available to All Developers" - GitHub Blog

https://github.blog/2022-06-21-github-copilot-is-generally-available-to-all-developers/

  • "Research: Quantifying GitHub Copilot's Impact on Developer Productivity and Happiness" - GitHub Blog

https://github.blog/2022-09-07-research-quantifying-github-copilots-impact-on-developer-productivity-and-happiness/

  • "How to Backtest a Trading Strategy Using ChatGPT" - Quantified Strategies

https://www.quantifiedstrategies.com/how-to-backtest-a-trading-strategy-using-chatgpt/




By Henry Booth November 11, 2023
Quant Careers Unveiled
By Henry Booth November 11, 2023
Navigating the Maze: A Guide to Quant Portfolio Manager Interviews
By Henry Booth September 18, 2023
The Evolution of Statistical Arbitrage: Rise of Alternative Data and Shorter Holding Periods Quantitative trading has long relied on statistical arbitrage, which uses complex mathematical models to spot and exploit fleeting price differences between related financial assets. Over the last decade, as a headhunter specialising in this field, I've observed significant shifts in the stat-arb trading landscape, with two notable trends coming to the forefront: the growing use of alternative data and the trend towards shorter holding periods. In this LinkedIn blog post, we explore the evolution of statistical arbitrage, from its inception featuring straightforward pairs trading and mean reversion strategies primarily based on technical data to today's sophisticated methodologies. Our discussion will focus on the rising significance of alternative data sources and the transition from holding periods of one to two weeks to predominantly intraday operations in the context of stat-arb strategies. The Early Days of Statistical Arbitrage What is Statistical Arbitrage? Statistical arbitrage, commonly abbreviated as Stat Arb, is a quantitative investment approach typically utilised by hedge funds. This strategy involves using intricate mathematical models to detect trading prospects originating from market inefficiencies. It operates on the concept that when the price of interrelated securities strays from its usual correlation, there's a high probability it will revert to its average over time. Factors such as mispricing, market sentiment, or temporary supply-demand imbalances could cause this divergence based on any statistical measure like correlation or cointegration. Statistical arbitrage trading techniques primarily focus on the evaluation of technical data, including historical price and trading volume information. Quantitative models sift through a massive amount of historical data to spot patterns and relationships, primarily identifying short-term mispricings and inefficiencies, which are subsequently leveraged for profit. Traders practising statistical arbitrage adopt a long position in the undervalued security and a short one in the overvalued counterpart, betting on the convergence of prices. This technique results in a market-neutral strategy that depends less on overall market movements and more on the relative price fluctuations of the involved securities. Various forms and timelines can accommodate statistical arbitrage, from high-frequency trading (HFT), where positions are held for exceedingly short periods, to medium-frequency trading (MFT) strategies, where positions could be held for days or even weeks. The strategy is commonly used in MFT, where patterns are considered more reliable than HFT due to potential noise. The adoption of HFT, in turn, assists MFT by enabling the swift execution of trades, usually within milliseconds or microseconds. This is crucial, considering the targeted price discrepancies are typically minimal and fleeting. Statistical arbitrage, while theoretically a low-risk strategy due to its market-neutral aspect, has risks. These can emanate from model overfitting, where predictions based on historical data may not sustain in the future, and from major market shocks that can disturb statistical relationships. Crucial updates hitting the market, such as earnings reports, unique dividends, or legal proceedings, are instances of stock market fluctuations that can disrupt short-term statistical correlations. The strategy requires three critical factors: predictability, as success is unlikely without the ability to foresee price movements; volatility, since statistical arbitrage is ineffective in low volatility scenarios such as those observed in 2016 and requires price movement for success; and dispersion, dispersion in price and variance of ideas and viewpoints and dispersion of price movements. The strategy thrives on differing opinions; it needs some to believe that prices will continue their trend while others expect them to revert. Meaning price movements need to be different in relation to others. They can’t all move up or all down. Stat-arb requires one up and one down! How quick The definition of HFT or MFT and their respective typical holding periods is not set in stone. Over a decade ago, HFT was associated with intraday or quicker holding periods—seconds, minutes, and sub-second durations. In contrast, medium frequency was anything from days to weeks, typically with an average holding period of one or two weeks. This categorisation has evolved, which now includes HFT, Intraday, MFT, and Low-Frequency Trading (LFT). HFT is ultra-fast trading that is measured in seconds, milliseconds and microseconds, the majority being sub-second. Intraday is anything from a minute upwards, 15 minutes, an hour, holding up to 6/8 hours to the end of the day. Nothing overnight, as the name suggests. MFT is days to weeks but can and does include intraday. Some strategies hold for minutes and hours out to a few days. The pure traditional stat-arb average holding period is within 1 to 3 weeks. LFT is, for me, anything holding longer than a month. Capacity Speed can’t be discussed without also linking to capacity and the two are intertwined. Trading strategies differ in their capacities based on their holding periods and execution speed. Capacity in trading is the maximum volume of stocks, securities, or commodities a system can effectively handle without notably impacting the market price. This is affected by the market's liquidity, the size of a trader's orders, their risk tolerance, and capital base. HFT and intraday strategies, operating at high speeds with short holding periods, typically have constrained capacities. This is due to their immediate impact on market prices. If you were to order $1bn worth of Apple stock suddenly, the market would move against you so fast that any alpha you’d predicted would disappear with the slippage and execution costs. HFT is more light-footed, in and out quickly in small amounts. Conversely, MFT and LFT strategies have higher capacities. MFT allows larger orders to be gradually executed, reducing immediate market impact. LFT strategies, spanning a month or longer, can accommodate substantial order sizes as trades are distributed over longer periods, thus reducing market impact and increasing capacity even further. Traditional stat-arb techniques Traditional techniques of stat arb encompass a variety of strategies. For instance, pairs trading, a widely used and straightforward method during the early days of statistical arbitrage, involves identifying pairs of highly correlated assets, such as stocks of companies within the same industry, like Coca-Cola and Pepsi. Traders would closely watch these pairs, waiting for their price relationship to diverge from the historical norm. The corresponding strategy would involve purchasing the underperforming asset while short-selling the overperforming one in anticipation of their prices eventually reverting to the historical average. Mean reversion is another prevalent stat-arb strategy. It operates on the premise that price movements of financial instruments are typically mean-reverting. This implies that when prices significantly stray from their historical averages, they are likely to revert to these averages over time. Traders who apply mean reversion strategies seek assets experiencing temporary price deviations and place trades on the expectation of these prices eventually returning to their historical levels. Mean reversion can be an independent strategy or can underpin pairs trading strategies. Index arbitrage is another tactic to leverage price discrepancies between index futures and their underlying stocks. Suppose futures are priced higher or lower than the index. In that case, traders may engage in simultaneous long and short positions in the futures and the underlying stocks, profiting from the anticipated price convergence. Speed is crucial for this strategy. Exchange-Traded Fund (ETF) arbitrage revolves around exploiting the price differences between an ETF and the underlying assets it represents. Traders can create or redeem ETF shares to benefit from the price difference and earn a risk-free profit. It's worth noting that the definition of statistical arbitrage and its included strategies aren't universally standardised. Every quantitative portfolio manager contributes their unique interpretation, skills, and market perspectives. Some may adopt a two-week holding period, while others opt for just a few days. While some focus on cash US equities, others diversify their portfolio with equities and futures. This diversity is the essence of market dynamics and its zero-sum nature. The strategies above serve as a simple starting point for understanding the complexities of statistical arbitrage. Having established a foundational understanding of statistical arbitrage and its historical context, we are now poised to delve into two prominent trends I've observed: the shortening of holding periods and the ascent of alternative data. We will explore the possible causes behind these trends and discuss their potential implications in finance. Shortening of Holding Periods Stat Arb strategies have witnessed a significant change in their holding periods over the years, transitioning from a typical duration of one to two weeks to predominantly intraday to few days timeframes. A combination of factors such as heightened competition, technological advancements, and the growing demand for rapid execution has influenced this shift. With more market participants stepping into this field and alpha signal decay setting in, strategies have progressively adapted to shorter holding periods. It is rare to encounter a pure stat-arb strategy maintaining positions beyond two weeks. Most operate within one to five days, but an increasing proportion of these strategies gravitate towards intraday trading with minimal to no overnight holding. This change has sparked a convergence of styles among different trading groups. Traditionally, HFT groups like Tower and Jump mainly focused on ultra-HFT strategies such as market making and index arbitrage. Their primary edge was speed, though they undoubtedly incorporated some form of statistical arbitrage. On the other hand, quant firms like WorldQuant and Cubist typically covered horizons of one to two weeks. Over time, these distinct approaches have melded. HFT groups have ventured into intraday and short-term strategies of a few days, while medium-frequency firms have also infiltrated the shorter-term intraday domain in their search for alpha. As more participants occupy the stat-arb landscape, the alpha diminishes as it gets arbitraged away, migrating further towards shorter-term strategies. It's important to note that statistical arbitrage doesn't lend itself to longer-term holding periods. As it primarily relies on technical data like price and volume, the relevance of these factors tends to diminish as the investment horizon extends beyond a month. Beyond this timeframe, price movements become more like noise as fundamental data such as earnings reports, financial statements, and economic indicators impact prices more than statistical anomalies. It is in the MFT to LFT space where both fundamental and alternative data become more useful, over technical data. Factors driving this trend The financial sector faces the twin challenges of escalating competition and the need for accelerated trade execution. Quantitative trading, in particular, is witnessing increased rivalry, with a burgeoning number of players applying quantitative techniques to exploit mispricings and inefficiencies. This surge in competition has led to temporary price discrepancies, once available over extended periods, becoming increasingly short-lived, necessitating rapid identification and execution of trades to take advantage of fleeting opportunities. In the face of technological progress, high-frequency trading has asserted itself as a leading approach. Enhanced computing power and upgraded trading infrastructure have enabled market participants to process and analyse immense volumes of data and execute trades at unmatched velocities. HFT's capacity to pinpoint minor price differences within fractions of seconds has significantly impacted financial markets, further diminishing holding periods. This ongoing technological evolution empowers quantitative portfolio managers and researchers to develop intricate statistical arbitrage strategies characterised by increasingly shorter holding periods. Implications of shorter holding periods for quant PMs and researchers: The transition towards shorter holding periods in stat arb carries several implications for quantitative portfolio managers and researchers. Foremost, the growing emphasis on speed and real-time decision-making mandates that traders remain updated with the newest technology and retain a state-of-the-art trading infrastructure. The arms race for alpha starts with the very technology you go into it with. Secondly, as holding periods contract, research endeavours increasingly concentrate on discovering and capitalising on more detailed market patterns and inefficiencies. Techniques using Ai and deep learning are pushing pattern recognition to new heights. Finally, the need for more sophisticated risk management and execution algorithms becomes critical to successfully negotiate the challenges tied to intraday trading and mitigate the impact of transaction costs on returns. There is no point in building the world's best prediction machine if the market slips away by the time you react and execute, and your execution costs eat your alpha. The shortening of holding periods was always evident in HFT trading. Groups battled to get quicker and quicker. They'd dig 1000km ditches straighter just to shave seconds off their execution between NYC and Chicago. They pay big money to co-locate their servers next to the exchange. They went from Java to C++, then to FPGA, and even microwave technology, all to be quicker. Now, the speed race is basically won by a couple of big HFT prop firms; as the cost of entry becomes far too great, a similar game is playing out in the MFT stat arb world with shortening holding periods. However, unlike HFT, which primarily focuses on increasing speed, MFT is all about improving speed and predictive accuracy. While HFT hinges on speed and technical data, MFT leans on prediction and technical data. As competition increases in the MFT arena, it paves the way for the next significant trend - the surge of alternative data! The Rise of Alternative Data Alternative data utilisation has seen a rapid surge recently. The industry's projected expenditure hit $1.7 billion in 2020, indicating a sevenfold jump from just five years before. The proliferation of internet usage, the growth of social media, the advent of the Internet of Things, and technological advances facilitating data creation and storage are key drivers behind the exponential increase in alternative data. By the end of 2025, The World Economic Forum believes we will create 400 times more data per day than in 2012… Alternative data refers to information not readily available through conventional financial sources like financial statements, analyst reports, or market price data. These non-traditional data sources provide additional insights into market behaviour, enabling traders to identify unique trading opportunities and gain a competitive edge. Examples of alternative data sources: Social media sentiment: The advent of social media platforms like Twitter, Facebook, and Reddit has opened up a vast repository of user-generated content reflecting public sentiment towards companies, products, and market trends. By analysing social media sentiment, traders can gauge investor sentiment and anticipate potential market movements, allowing them to make more informed trading decisions. Or they ignore it and get squeezed out, like in Gamestop. Satellite imagery: Satellite imagery provides valuable information about various economic activities, such as the level of construction, traffic patterns, and even the number of cars in a retailer's parking lot. By analysing this data, traders can gain insights into a company's performance, sales, or supply chain dynamics, which can, in turn, help inform their trading strategies. Credit card transactions: Aggregated credit card transaction data offers insights into consumer spending habits, allowing traders to monitor trends and assess the health of specific companies, sectors, or the broader economy. This information can be especially valuable in predicting earnings announcements or understanding the competitive dynamics within a particular industry. Web Traffic and App Usage Data: Data from website traffic, mobile application usage, and online platforms can offer insights into consumer behaviour, brand popularity, and potential sales trends. For example, increased visits to a retailer's website or an uptick in app downloads could signal stronger-than-expected quarterly results. However, be wary of betting on page views to avoid having the next pets.com on your book! News Sentiment Analysis: Natural language processing (NLP) techniques can be used to analyse news articles and press releases to extract sentiment about a particular company or sector. Changes in sentiment can potentially be used to predict future price movements. Geolocation Data: Data from smartphones and GPS devices can reveal patterns in consumer behaviour, such as foot traffic to a retail store or visits to a particular location, which can indicate a business's popularity or potential sales. I know one strategy analysed the footfall into every Starbucks in the USA. Over the quarters, it could see in real-time whether there were more customers or fewer than the previous quarter, and so predict an earnings miss or beat. Weather Data: Weather patterns can influence consumer behaviour and impact operations in the agriculture, retail, and energy industries. For example, hot weather could boost sales for a clothing retailer or impact crop yields for a farming company. An area Citadel supposedly excels in with a team of weather scientists predicting weather patterns. Others include; E-commerce Data, Supply Chain Data, Public Records, Healthcare Data and more. Alternative Data Usage As the landscape of statistical arbitrage evolved, the focus expanded beyond traditional technical data, and market participants began exploring alternative data to enhance their trading strategies. Alternative data has been used in the LFT and multi-factor trading space for many years. Here, alternative data nicely dovetails with fundamental data to enhance insights. But there is a growing trend of combining alternative and technical data in the MFT stat-arb world. Alternative data isn’t used in HFT. Knowing how many people walked into a Starbucks is ultimately pointless in ultra-fast trading. Incorporating alternative data into statistical arbitrage has significantly diversified the strategies and techniques available to quant PMs and researchers in MFT. By tapping into these new data sources, traders can uncover new signals, develop more robust models, and improve their ability to generate alpha. The use of alternative data has led to the creation of entirely new trading strategies and enhanced existing ones, allowing for the identification of more subtle and complex relationships between financial instruments and providing additional risk management opportunities. However, acquiring alternative data doesn't automatically generate an overflow of returns. As per Bloomberg, some data sets may be less immediately valuable. As Chris Longworth, a senior scientist at GAM Systematic in the U.K, notes, accessing better data is just part of the picture. Equally important is how this data is incorporated into models and how the resulting uncertainties are handled. Factors driving this trend While the surge can be attributed to enhanced data availability, advancements in analytics, and the pursuit of superior informational edge, I have an underlying personal theory linking this rise with the evolving dynamics within the teams implementing these strategies. In quantitative trading, senior PMs hold the reins of time-tested stat-arb models. These models, honed over years or even decades, are akin to a Formula One car—carefully engineered and relentlessly fine-tuned for maximum performance. However, in this race for alpha, PMs are wary about sharing the blueprints of their "Formula One cars." They understandably guard the intellectual property of their strategies, quite rightly limiting junior researchers' access to avoid the risk of their proprietary methods being taken and replicated or used competitively elsewhere. While the senior PMs are engrossed in enhancing their high-performance trading models, an interesting shift is observed among junior researchers. They are progressively focusing on novel datasets, especially in cash equities, spurred by the seniors' justified protective stance over traditional stat-arb strategies. Keen to deliver alpha, junior researchers relish the opportunity to dive into the untapped potential of alternative data. Leveraging the latest modelling techniques and machine learning methodologies, they can extract valuable insights, creating a refreshing alternative to routine tasks like portfolio construction, data cleaning, risk analysis, or execution-type research work. The trend is driven, in part, by the simple fact a senior PM doesn't want their junior too close to their original strategy, so rather than give them historical price data sets, they give them alternative data sets to see if any value can be found. Another factor is that, initially, low-frequency and multi-factor strategies found the most utility in alternative data, allowing for more accurate long-term forecasts. For example, satellite data determining the frequency of cars in Walmart parking lots could predict earnings and guide investment strategy. Traditional stat-arb focused on exploiting short-term market price inefficiencies in technical data and previously considered such alternative data irrelevant. However, this viewpoint is evolving. In recent years, stat-arb is increasingly blending with alternative data. It's like tweaking the Formula One car's engine to run on a novel fuel mixture, aiming for short-term price inefficiencies while keeping an eye on potential long-term shifts. This trend emerged particularly during the low-volatility environment of 2016-2018, where groups sought to incorporate alternative data to supplement their work. The real breakthrough lies in machine learning's application in statistical arbitrage, particularly deep learning. These algorithms, capable of discerning complex patterns and relationships in data, facilitate the identification of temporary mispricings and market inefficiencies more efficiently. As deep learning algorithms' insatiable appetite for data grows, we can expect even greater use of alternative data, pushing the boundaries of quantitative trading towards exciting, uncharted territories. Conclusion "It is not the smartest or strongest that survive, but the ones most adaptable to change that survive." The landscape of statistical arbitrage has evolved dramatically over the past decade, with the increased incorporation of alternative data sources and the shortening of holding periods as two key trends shaping the field in different ways. These changes have brought challenges and opportunities for quant PMs and researchers, requiring them to adapt and innovate to stay competitive. Integrating alternative data into stat-arb strategies has expanded the range of techniques available to market participants, allowing them to uncover new trading opportunities and improve the overall effectiveness of their models. However, the shift towards shorter holding periods has also emphasised the need for speed, real-time decision-making, excellent execution and advanced risk management. Statistical arbitrage, in its broadest sense, is here to stay. Traders will continuously seek statistical patterns that offer trading opportunities. However, the conventional understanding of stat-arb as a stand-alone strategy must be updated. Maintaining a competitive edge in today's market necessitates incorporating additional data, employing new techniques, shorter holding periods, and leveraging advancements in machine learning and other technologies. As the world of statistical arbitrage continues to change, it is crucial for quant PMs and researchers to remain agile and embrace the opportunities presented by this evolving landscape. By staying at the forefront of technology and continually refining their skills, they can harness the full potential of alternative data, develop cutting-edge strategies, and ultimately succeed in this highly competitive and dynamic industry. Strategies using alternative data might now be more aptly described as 'data arbitrage'. What are you seeing? — We're eager to hear your perspectives on the trends highlighted here and their influence on your quant trading and statistical arbitrage experiences. Any other trends or challenges in your view? Share your thoughts below.  If you found this discussion valuable and want to stay informed about the latest developments in quant trading, statistical arbitrage, and the broader finance industry, be sure to follow our page. We will continue to share insights, updates, and thought-provoking content that aims to inform, educate, and inspire. We also encourage you to engage with our future posts by sharing your thoughts, questions, and experiences as we strive to stay ahead in this ever-evolving field. Further Reading: https://www.fintechnews.org/how-hedge-funds-use-alternative-data-to-make-investments/ https://www.globenewswire.com/news-release/2023/04/19/2649810/0/en/Alternative-Data-Global-Market-Report-2023-Rising-Demand-From-Hedge-Funds-Bolsters-Sector.html https://consent.yahoo.com/v2/collectConsent?sessionId=3_cc-session_9f07d6ae-5eae-4a42-9312-67512ca13ce1 https://www.cityam.com/after-ai-update-bloomberg-looks-to-boost-terminals-with-more-alternative-data/ https://www.bloomberg.com/news/articles/2022-12-16/quant-traders-are-big-winners-in-this-year-s-market-turmoil?leadSource=uverify%20wall

Please free free to share

Share by: