From Human Mapping to Machine Embedding: Uncovering Key Legal Drivers and Deterrents of ISDS Filing Frequencies
International investment agreements (IIAs), while intended to prop cross-border investment, have faced persistent criticism for potentially undermining the regulatory sovereignty of developing countries. Various mechanisms have been proposed as alternatives to traditional bilateral investment treaty (BIT) models, often with the goal of curbing investor-state dispute settlement (ISDS) filings. While existing research has uncovered the impact of nonlegal factors, such as macroeconomic crises, little has been done to systematically examine how legal provisions in either major model BITs or ISDS reform toolboxes influence ISDS filing patterns. To address this gap, this Article analyzes the interplay between (i) legal texts of 2,148 BITs and treaties with investment provisions (TIPs) and (ii) the occurrence of 1,060 ISDS cases. It builds on the United Nations Conference on Trade and Development (UNCTAD)’s IIA Mapping Project to assess the impact of key legal deterrents recommended by ISDS reform proponents, while leveraging large language models to identify the key legal drivers of ISDS filings. The outcome of Poisson regression appears to reveal that: (i) procedural provisions resembling those in the 2012 U.S. Model BIT are the strongest positive predictors of ISDS filings, outweighing the impact of economic crises, whereas substantive provisions such as investor treatment and expropriation clauses are not; (ii) the effectiveness of deterrent provisions remains inconclusive, suggesting that their ability to curb ISDS filings requires further scrutiny; and (iii) the assumption that IIAs between developed host countries and developing states are more prone to ISDS filings is unsubstantiated. These findings could contribute to ongoing discussions on BIT reform by highlighting the legal determinants of ISDS frequencies, with implications for policymakers seeking to balance investment protection with regulatory autonomy.
I. Introduction
International investment agreements (IIAs) have remained a cornerstone of international investment protection, proliferating with at least 2,844 bilateral investment treaties (BITs) and 480 treaties with investment provisions (TIPs) as of March 1, 2025.1 However, IIAs and their Investor-State Dispute Settlement (ISDS) mechanisms have faced persistent criticism for potentially undermining governments’ regulatory sovereignty, particularly during economic or social crises, while lacking sufficient transparency and predictability. In response, the U.N. Commission on International Trade Law (UNCITRAL)’s Working Group III has been drafting proposals for ISDS reform,2 while reputable institutions—including Chatham House, the Columbia Center on Sustainable Investment, and the Center for the Advancement of the Rule of Law in the Americas at Georgetown—have published reports and proposed toolkits addressing ISDS reform.3
While these efforts explore systemic changes, their specific recommendations frequently focus on curbing ISDS filings both substantially and procedurally. 4 One reason is that a high volume of ISDS filings not only signals deeper systemic issues in the drafting or interpretation of IIAs but also contributes to regulatory chill and places significant resource burdens on states—particularly developing countries.5 Despite this focus on the frequency of ISDS filings, research remains sparse on whether and how legal provisions in major model BITs or ISDS reform toolkits influence ISDS filing patterns.6 Notably, a study of the frequency of ISDS filings focuses on macroeconomic factors; Bellak and Leibrecht’s 2021 paper—whose insight this paper draws upon—identifies economic crises and the number of previous ISDS filings by investors in a specific host country as key determinants of ISDS filing frequencies.7 Berge’s 2020 paper, meanwhile, finds that substantive obligations—including national treatment (NT), most-favored-nation treatment (MFN), fair and equitable treatment (FET), and compensation for expropriation—are the only significant legal predictor of ISDS occurrences, while greater flexibility or precision in treaty language does not significantly affect ISDS risks.8 Rather than assess the impact of specific substantive or procedural provisions, this study categorizes mapped IIA provisions into broader attributes and arrives at a conclusion that contrasts with Berge’s 2020 finding.
In this context, this study analyzes the interplay between (i) the legal texts of 2,148 BITs and treaties with investment provisions that came into effect before 2019 and (ii) 1,060 ISDS cases filed before 2020, taking into account the availability of economic crisis data, one of the most critical control variables, up to 2019. A key challenge in this analysis is how to quantify treaty provisions. Significant efforts have been made to manually map IIA provisions, as exemplified by the U.N. Conference on Trade and Development (UNCTAD)’s IIA Mapping Project.9 This dataset has been instrumental in prior studies, including Berge’s 2020 paper mentioned above and Thompson et al.’s 2019 study, which explores how actual ISDS cases influence calls for IIA modification or termination.10
While this Article also leverages UNCTAD’s mapping data, it introduces novel analytical tools, incorporating the embedding capabilities of large language models (LLMs) to gain deeper insights into the legal drivers of ISDS filings. There are previous studies that analyzed the texts of IIAs. Notably, Baccini et al.’s 2015 study used cluster analysis to classify IIAs into three categories: the EU model, the NAFTA model, and the Southern model.11 Allee et al.’s 2014 study found that stronger enforcement provisions in BITs correlate more with the bargaining power of the capital-exporting treaty partner than with the initiative of investment-seeking states.12 Unlike the previous studies, which relied on manual coding, this study leverages the analytical capabilities of LLMs to engineer features that capture the substantive and procedural dimensions of IIA provisions. Poisson regression analysis, encompassing the newly engineered features, indicates that: (i) procedural provisions resembling those in the 2012 U.S. Model BIT are the most significant positive predictors of ISDS filings, surpassing the influence of economic crises and other variables, while substantive provisions do not show clear impact, contrary to Berge’s findings; (ii) the effectiveness of deterrent provisions remains uncertain, suggesting that their role in reducing ISDS filings requires further investigation; and (iii) the widely held belief that IIAs between developed host countries and developing states are more susceptible to ISDS filings is not demonstrated.
II. Methodology
This Article seeks to identify the legal drivers and deterrents influencing the frequency of ISDS filings. To achieve this, it employs a multi-step approach: (i) extracting treatment, expropriation, and procedural provisions from IIAs; (ii) transforming these provisions into embeddings using LLMs and quantifying their similarity to the U.S. Model BIT 2012; and (iii) incorporating the resulting similarity scores—alongside other variables such as macroeconomic indicators and ISDS deterrent provisions—into a Poisson regression model designed to explain the occurrence of ISDS filings. To ensure replicability, a code script and dataset are publicly available at https://github.com/replicable/isds.
A. Data Collection
As noted, the UNCTAD Investment Policy Hub provides access to the IIA Navigator, which includes 2,844 BITs and 480 TIPs, totaling 3,194 agreements. From this dataset, I extracted 2,148 effective treaties after applying the following exclusions: (i) 108 treaties that are not yet in force (terminated treaties, however, are included); (ii) 17 overlapping samples, primarily related to the dissolution of the Soviet Union; (iii) 151 IIAs that came into effect after 2019, as I failed to obtain consistent economic crisis data after this period; (iv) 749 IIAs lacking UNCTAD’s mapping data (however, I manually mapped 11 IIAs that had triggered five or more ISDS cases but were omitted in the UNCTAD’s database, including the North American Free Trade Agreement (NAFTA) and the Energy Charter Treaty (ECT), and included them in the dataset); and (v) 146 IIAs without ISDS provisions, including 12 treaties incorrectly coded by UNCTAD as containing ISDS provisions and 10 treaties for which the presence of ISDS provisions cannot be definitively verified due to the unavailability of an English version. Additionally, the UNCTAD Investment Policy Hub offers the Investment Dispute Settlement Navigator that provides information on 1,332 known treaty-based ISDS cases,13 from which I extracted 1,060 cases which were filed before 2020.
For these samples, UNCTAD mapping data is available. However, to conduct a more detailed analysis of substantive provisions—including NT, MFN, FET, and expropriation as well as procedural ISDS provisions, I developed and implemented an automated method to identify relevant treaty articles. While this approach involved a keyword search and cosine similarity of embeddings, initially focusing on article headings, and when unavailable, analyzing paragraph content,14 I manually reviewed and corrected misclassifications to sort out irrelevant articles. While the UNCTAD Investment Policy Hub provides scanned treaty versions, these often trigger high levels of optical character recognition errors. To overcome this issue, my code matched and extracted text file versions from the Electronic Database of Investment Treaties (EDIT), managed by the World Trade Institute.15
B. Preprocessing
From each IIA, three key provisions are extracted: (i) treatment of investors, consolidated in the order of FET, NT, and MFN16 ; (ii) expropriation; (iii) procedures for submission of an ISDS claim to arbitration, including consent to arbitration and conditions and limitations on consent. Other substantive obligations, such as free transfer and prohibition of performance requirements, were excluded due to their relative homogeneity and lesser significance in ISDS disputes. Additionally, while procedural provisions related to the constitution of arbitral panels and the conduct of arbitration may also be significant, they were excluded from this analysis due to (i) their absence in many IIAs and (ii) the technical challenges of reliably and consistently isolating them from single-article, all-inclusive ISDS provisions within certain IIAs found in many IIAs. To minimize potential bias from this omission, the analysis focuses on comparing relevant procedural content with Article 24 (Submission of a Claim to Arbitration), Article 25 (Consent of Each Party to Arbitration), and Article 26 (Conditions and Limitations on Consent of Each Party) of the 2012 U.S. Model BIT, while excluding other procedural articles such as Article 27 (Selection of Arbitrators), Article 28 (Conduct of the Arbitration), and Article 29 (Transparency of Arbitral Proceedings).
Each IIA’s three provisions were transformed into contextual embeddings for numerical analysis of their semantic meaning. This process employs OpenAI’s GPT-3.5 vector embedding model, text-embedding-3-small, which produces 1,536-dimensional embeddings from textual inputs.17 My code then calculates the cosine similarity of each of the three provisions to those in the 2012 U.S. Model BIT, which might stand for the sophistication of the provision or customary international practices.18 Cosine similarity between two vectors in an embedding space reflects how often the corresponding tokens co-occur, indicating their semantic closeness. For sentence embeddings, the cosine similarity is computed as the weighted average of the similarities between the embeddings of the tokens that make up the sentence. This score is highly effective in capturing the semantic similarity between two provisions.
An example of this methodology—comparing NAFTA with the 2012 U.S. Model BIT—is presented in Table 1.
NAFTA | 2012 U.S. Model BIT | Similarity | |
---|---|---|---|
Treatment |
Article 1105 (Minimum Standard of Treatment) (omitted) Article 1102 (National Treatment) (omitted) Article 1103 (Most-Favored-Nation Treatment) (omitted) |
Article 5 (Minimum Standard of Treatment) (omitted) Article 3 (National Treatment) (omitted) Article 4 (Most-Favored-Nation Treatment) (omitted) |
0.940 |
Expropriation | Article 1110 (Expropriation and Compensation) (omitted) | Article 6 (Expropriation and Compensation) (omitted) | 0.900 |
Procedure |
Article 1116 (Claim by an Investor of a Party on Its Own Behalf) (omitted) Article 1120 (Submission of a Claim to Arbitration) (omitted) Article 1122 (Consent to Arbitration) (omitted) |
Article 24 (Submission of a Claim to Arbitration) (omitted) Article 25 (Consent of Each Party to Arbitration) (omitted) Article 26 (Conditions and Limitations on Consent of Each Party) (omitted) |
0.875 |
Figure 1 illustrates how the embedding-based similarity scores work for IIAs that the U.S. has entered into, when the similarity scores for treatment, expropriation, and procedure are used as the x, y, and z coordinates, respectively. The U.S.–Uruguay BIT (2005) and the Rwanda–U.S. BIT (2008) exhibit particularly high levels of similarity (98.7% and 98.5% on average), whereas the U.S.–Sri Lanka BIT (1991) shows a lower level of similarity (78.5% on average). Notably, the United States-Mexico-Canada Agreement (USMCA) (2020) (based on the U.S.-Mexico terms) and NAFTA (1992) exhibit average similarity scores of 94.93% and 90.50%, respectively.

The code puts all the 3,194 IIAs and applies k-means clustering to provide insight into the convergence and divergence of IIAs in the global investment landscape. Figure 2 presents the results of k-means clustering, showing that IIAs are effectively clustered into three groups,19 along three axes: similarity to the 2012 U.S. Model BIT in terms of treatment, expropriation, and procedure. Group 1 consists of modern FTAs that are heavily influenced by NAFTA, whereas Group 3 includes IIAs that diverge significantly from NAFTA in terms of investor treatment and Group 2 is in the mezzanine. The samples most adjacent to centroids of Group 1, 2, 3 are Colombia – U.K. BIT (2010), China – U.K. BIT (1986), and France – Panama BIT (1982), respectively.

C. Model
A key hypothesis to test is that the frequency of ISDS filings in a given calendar year follows a Poisson distribution,20 influenced by several categories of variables: 𝐶𝑟𝑖𝑠𝑖𝑠t, 𝑃𝑎𝑟𝑡𝑦ij, 𝑇𝑒𝑟𝑚𝑖𝑛𝑎𝑡𝑒at, 𝑈𝑆𝐵𝐼𝑇𝑆𝑖𝑚a, and 𝐷𝑒𝑡𝑒𝑟a.
log(E(Filingat|Xati))~ Xati
Xati=εat+θ1TCrisisti+θ2TPartyti+ θ3TTerminateat+θ4TUSBITSima(+θ5TDetera)
where α represents a specific IIA (out of 2,148 in total), t denotes a specific year between 1993 to 2019 (following the IIA’s effective date), i denotes a particular IIA, and Eat is a residual for the IIA in a given year. The dataset comprises 39,524 treaty-dyadic yearly samples, which comprise the following variables:
- Filingat: Dependent variable
This represents the number of ISDS filings for a specific IIA in a given year. Figure 3 presents the top 10 IIAs with the highest cumulative number of ISDS filings during 1993–2024.

- Crisisti: Economic crises
As noted, the selection of this variable is inspired by Bellak and Leibrecht (2021).21 However, unlike their study, which uses country-dyadic data, this analysis is conducted at the treaty level, as this Article matches ISDS cases with relevant IIAs, rather than with a specific country. For multilateral IIAs, Crisisti is coded as 1 if at least one of the participating parties is experiencing an economic crisis.
To capture economic crises, I utilized three dummies indicating banking, currency, and debt crisis coded by Nguyen et al. (2022), covering the period of 1993 to 2019.22 Due to the lack of crisis data beyond 2019, IIAs that came into effect after that year were excluded from the analysis.
- Partyti: Other treaty party characteristics
This vector consists of two variables: (i) whether at least one party was a developed country and at least one party was a developing country in a given year, based on the International Monetary Fund’s classification of advanced economies and emerging and developing economies23 ; and (ii) the total number of ISDS claims filed by investors within the host country prior to the given year, which Bellak and Leibrecht (2021) also identified as one of the key factors affecting ISDS filing frequency.24
- Terminateat: Years after termination of the treaty
Apart from IIAs that were replaced by another agreement, already terminated IIAs each year are assigned a value of 1 for this variable, rather than being entirely excluded from the sample.
- USBITSima: Similarity to the 2012 U.S. Model BIT
This vector comprises three scores representing similarity to the 2012 U.S. Model BIT, in terms of treatment, expropriation, and the initiation of ISDS claims.
- Detera: Deterrent provisions
This vector consists of dummy variables representing potentially deterrent provisions as coded by UNCTAD’s IIA Mapping Project, including: (i) IIA scope exclusions (taxation, subsidies, procurement, others, and applicable only to post-BIT events), (ii) MFN exceptions (economic integration, taxation treaties, procedural issues, regulatory), (iii) expropriation exceptions (regulatory and compulsory licenses), (iv) social protections (health and environment, labor standards, right to regulate, corporate social responsibilities, and not lowering of standards), (vi) exception self-judging, (vii) ISDS scope exclusions (scope of claims, ISDS limitation, policy exclusion, taxation or prudential, limited remedies, and limiting amicus curiae), and (viii) ISDS filing restrictions (case-by-case consent, fork in the road, no U turn, limitation period, local remedies first, and voluntary ADR). While UNCTAD provides a detailed codebook for each variable,25 Appendix I provides its summary.
III. Outcomes
A. Outcome of Poisson Regression
The results of the Poisson regression predicting the counts of ISDS filings are presented in Table 2.
Model Without Detera | Model With Detera | |
---|---|---|
(Intercept) | -9.3946547*** (< 2e-16) | -10.617567*** (< 2e-16) |
Crisis: Banking Crisis: Currency Crisis: Debt |
0.3477968*** (1.59e-05) 0.1575510 (0.204553) -0.0315204 (0.702595) |
0.435705*** (1.96e-07) 0.243165. (0.052653) 0.073466 (0.397282) |
Party: Develop Party: Prior Claims |
-0.2422656*** (0.000140) 0.0039947*** (1.87e-05) |
0.001301 (0.986160) 0.007070*** (1.58e-13) |
Terminate | -0.0915554 (0.272499) | -0.103608 (0.224036) |
USBITSim: Treatment USBITSim: Expropriation USBITSim: Procedure |
-2.5288473*** (0.000794) -2.2438151*** (2.94e-08) 13.7917405*** (< 2e-16) |
1.622045 (0.102204) -2.757705*** (1.53e-05) 9.964697*** (< 2e-16) |
IIA: Taxation IIA: Subsidies IIA: Procurement IIA: Others IIA: Post-BIT MFN: economic integration MFN: taxation MFN: procedural issues Expropriation: Regulatory Expropriation: Compulsory licenses Social: Health and environment Social: Labor standards Social: Right to regulate Social: Corporate responsibility Social: Not lowering standards Security: Exception self-judging ISDS Scope: Treaty Claims ISDS Scope: Limitation ISDS Scope: Policy ISDS Scope: Taxation or prudential ISDS Scope: Limited remedies ISDS Scope: Allowing amicus curiae ISDS Filing: Case-by-case consent ISDS Filing: Fork in the road ISDS Filing: Limitation period ISDS Filing: Local remedies first ISDS Filing: Voluntary ADR |
-0.223596* (0.042177) 2.154914*** (< 2e-16) -2.159584*** (2.79e-15) 0.311248* (0.013689) 0.548526*** (3.89e-06) -0.843211*** (2.29e-11) 0.227083* (0.036309) 0.754468*** (3.14e-07) -1.914092*** (8.68e-10) 0.941618*** (2.05e-05) 0.314018* (0.018089) 0.124098 (0.550898) 1.224714*** (< 2e-16) -1.271816* (0.010943) -0.295417 (0.201128) -0.257900 (0.195030) -0.103657 (0.207556) 0.358940*** (0.000573) -0.044751 (0.800290) -0.199341 (0.289735) 0.731173*** (0.000930) 0.928169* (0.017298) 0.106267 (0.472885) -0.927350** (0.002669) -1.071619*** (2.19e-06) -0.206317 (0.237112) 0.244765** (0.001312) |
|
N | 39,524 | 39,524 |
AIC | 9805.6 | 8616.7 |
Dispersion | 1.823704 | 1.344172 |
Max VIF | 1.3808 (USBITSim: Procedure) | 8.394632 (IIA: Procurement) |
Overall, procedural provisions, prior claims, and banking crise appear to be the strongest predictors of ISDS filings.
The similarity of procedural provisions to the 2012 U.S. Model BIT emerges as a major driver of ISDS filings, reinforcing the hypothesis that procedural elements in treaties play a crucial role in dispute occurrences. The coefficients of 13.792 and 9.965 for the procedural similarity score are notably high, translating to a 976,764- and 21,269-fold increase (e13.792 and e9.965) in ISDS filing frequencies, respectively. The primary reason may be that the variable’s variance is significantly lower than that of other 0-or-1 dummy variables, as shown in Figure 2. Table 2 indicates that both models exhibit a decent degree of dispersion (1.8237 and 1.3442, respectively), suggesting that variance does not deviate from mean.
In contrast, expropriation provisions (and treatment provisions in the case of the model without Detera) exhibit a surprisingly negative correlation with ISDS filings, challenging prior literature that highlights substantive obligation standards as key determinants.
Among party-specific factors, prior ISDS claims serve as a predictor, indicating that countries with a history of ISDS cases are more likely to face future claims, though its coefficient is not particularly strong. Meanwhile, developed-developing party status has a surprisingly negative impact in the model without Detera and no significant impact in the model with Detera, undermining the assumption that IIAs between developed host countries and developing states are inherently more prone to ISDS filings.
The effectiveness of deterrent provisions in reducing ISDS claims remains inconclusive. While some provisions—such as procurement exceptions (-2.16), regulatory carve-outs for expropriation (-1.91), corporate social responsibility (CSR) (-1.27), filing limitation periods (-1.07), fork-in-the-road clauses (-0.93), and economic integration exceptions for MFN (-0.84)—are associated with lower ISDS risk, the broader impact of deterrent mechanisms is not clearly substantiated. In particular, subsidy/grant exceptions (+2.15), right to regulate (+1.22), exclusion of compulsory licenses from expropriation (+0.94), exclusion of ISDS provisions found in other treaties (+0.75), limited remedies (+0.73), post-BIT provisions (+0.55), etc. unexpectedly correlate with more ISDS filings, while they are designed to restrict ISDS filings or awards.
B. Robustness Checks
As noted, the coefficients of 13.792 and 9.965 for the procedural similarity score are notably high. However, as both models show a reasonable level of dispersion (1.8237 and 1.3442, respectively), these unusually high coefficients would not signal overdispersion—where the variance exceeds the mean— in the models. As such, it is not necessary to implement negative binomial distribution models.
A key robustness challenge lies in the overwhelming number of zero counts in ISDS filings: out of 39,524 treaty-dyadic yearly observations, 38,763 (98.1%) record no ISDS filings, while only 632, 73, and 28 observations record one, two, and three filings, respectively. This justifies the use of a zero-inflated Poisson (ZIP) regression model, which first uses a logistic regression to model the probability of excess zeros and then applies a Poisson regression to model count outcomes for observations not predicted to be structural zeros.27 Table 3 shows its outcome.
Model Without Detera | Model With Detera | |
---|---|---|
(Intercept) | -1.036583 (0.13445) | -10.619204*** (8.46e-05) |
Crisis: Banking Crisis: Currency Crisis: Debt |
0.038923 (0.78041) -0.056606 (0.82031) -0.162466 (0.28447) |
0.380994* (0.010475) 0.651357** (0.002915) 0.806135*** (4.93e-05) |
Party: Develop Party: Prior Claims |
-0.429502** (0.00428) -0.001853 (0.34385) |
-0.645591*** (0.000975) -0.007406*** (0.000480) |
Terminate | -0.305293 (0.16325) | -0.059928 (0.705238) |
USBITSim: Treatment USBITSim: Expropriation USBITSim: Procedure |
-2.980004* (0.02992) -7.840975*** (8.35e-09) 13.294663*** (< 2e-16) |
7.702902** (0.002470) 0.799259 (0.638882) 3.246007. (0.063159) |
IIA: Taxation IIA: Subsidies IIA: Procurement IIA: Others IIA: Post-BIT MFN: economic integration MFN: taxation MFN: procedural issues Expropriation: Regulatory Expropriation: Compulsory licenses Social: Health and environment Social: Labor standards Social: Right to regulate Social: Corporate responsibility Social: Not lowering standards Security: Exception self-judging ISDS Scope: Treaty Claims ISDS Scope: Limitation ISDS Scope: Policy ISDS Scope: Taxation or prudential ISDS Scope: Limited remedies ISDS Scope: Allowing amicus curiae ISDS Filing: Case-by-case consent ISDS Filing: Fork in the road ISDS Filing: Limitation period ISDS Filing: Local remedies first ISDS Filing: Voluntary ADR |
-2.106189*** (1.80e-14) 1.849253*** (9.61e-06) -3.281795*** (9.64e-12) 1.119760*** (0.000100) 0.194069 (0.508992) -0.947298*** (0.000543) 0.162344 (0.491965) 2.286047*** (3.59e-08) -0.952592* (0.047612) 2.727114*** (3.77e-09) -1.252208*** (0.000101) 0.143434 (0.768106) 1.347133*** (1.67e-08) -0.244752 (0.771486) -0.190675 (0.657703) 0.417826 (0.381988) 0.240664 (0.145280) 0.872235*** (7.94e-05) -0.214258 (0.399828) 0.172492 (0.618988) 0.287729 (0.359550) 0.665365 (0.440267) -0.02385 (0.946945) -2.256180*** (0.000322) -1.772655*** (2.78e-06) 0.402943 (0.258133) -0.035774 (0.848018) |
|
N | 39,524 | 39,524 |
The results of the model with Detera are largely consistent with those of the Poisson regression, with the exception that the macroeconomic crisis variables and prior claims lose their explanatory power. In the model with Detera, though the statistical significance of similarity in procedural provisions is reduced, it still shows a positive correlation with higher ISDS filing counts.
C. Implications
This reinforces previous findings that ISDS filing occurrences are closely linked to banking crises, as evidenced by the exceptionally high number of filings faced by Argentina and Venezuela during the Chávez-Maduro era. Setting aside the political turmoil that accompanied these crises and contributed to increased filings, this trend may prompt states to reassess the impact of IIAs on their ability to navigate financial crises effectively.
While aligning substantial obligations with global standards—such as the 2012 U.S. Model BIT—appears to be either neutral or even ISDS-deterrent, potentially due to enhanced clarity and predictability, the alignment of procedural provisions is strongly and significantly correlated with ISDS occurrences. States interested in ISDS reform may find it more effective to focus on procedural provisions.
The impact of deterrent provisions on reducing ISDS filings remains inconclusive. Given that ISDS reform discourses are still relatively new and these provisions lack a long historical track record, further research is needed before drawing definitive conclusions.
D. Limitations
This study has several limitations. First, the method used to extract treatment, expropriation, and procedural provisions from IIAs may influence the calculated similarity scores and, consequently, the outcomes of the regression models. Second, the Detera variables rely heavily on human-coded data from the UNCTAD IIA Mapping Project, meaning any inaccuracies in the coding could affect the reliability of the regression results. Lastly, the study does not account for potential reverse causality: countries facing higher ISDS risk might be more likely to adopt U.S.-style treaties or incorporate stronger deterrent provisions.
IV. Conclusion
Rather than simply embracing the hype surrounding AI, this Article critically examines its practical role in augmenting traditional empirical research, demonstrating how machine learning and text embeddings can enhance quantitative legal analysis. By leveraging AI-driven methodologies, this approach enables more precise data clustering, offering deeper insights into patterns of convergence and divergence across jurisdictions. Additionally, AI-assisted text analysis facilitates the creation of novel independent variables, allowing for more nuanced and comprehensive empirical assessments of ISDS dynamics.
The findings reinforce the importance of procedural treaty design in driving ISDS filings while challenging conventional assumptions about the role of substantive treatment provisions and developed-developing country dynamics. Furthermore, AI-driven legal analytics can empower policymakers by providing empirical clarity on the impact of deterrent provisions, helping refine IIA harmonization efforts at the global level and guiding more informed national strategies for negotiating IIAs and managing ISDS risks. This Article underscores AI’s potential not as a replacement for legal expertise, but as a powerful tool to complement traditional methods, offering policymakers a data-driven foundation for evidence-based treaty design and dispute resolution strategies.
[Appendix I]
Codebook
Variables | Type | Coding Criteria28 |
---|---|---|
Crisis: Banking Crisis: Currency Crisis: Debt |
Binary Binary Binary |
|
Party: Develop Party: Prior Claims |
Binary Continuous |
|
Terminate | Binary |
|
USBITSim: Treatment USBITSim: Expropriation USBITSim: Procedure |
Continuous Continuous Continuous |
|
IIA: Taxation IIA: Subsidies IIA: Procurement IIA: Others IIA: Post-BIT |
Binary Binary Binary Binary Binary |
|
MFN: economic integration MFN: taxation MFN: procedural issues |
Binary Binary Binary |
|
Expropriation: Regulatory Expropriation: Compulsory licenses |
Binary Binary |
|
Social: Health and environment Social: Labor standards Social: Right to regulate Social: Corporate responsibility Social: Not lowering standards |
Binary Binary Binary Binary Binary |
|
Security: Exception self-judging | Binary |
|
ISDS Scope: Treaty Claims ISDS Scope: Limitation ISDS Scope: Policy ISDS Scope: Taxation or prudential ISDS Scope: Limited remedies ISDS Scope: Allowing amicus curiae |
Binary Binary Binary Binary Binary Binary |
|
ISDS Filing: Case-by-case consent ISDS Filing: Fork in the road ISDS Filing: Limitation period ISDS Filing: Local remedies first ISDS Filing: Voluntary ADR |
Binary Binary Binary Binary Binary |
|
- 1Investment Policy Hub, International Investment Agreements Navigator, UNCTAD https://perma.cc/R9WD-HXNP (last visited Mar. 1, 2025).
- 2UNCITRAL, Possible Reform of Investor-State Dispute Settlement (ISDS): Draft multilateral Instrument on ISDS Reform, U.N. Doc A/CN.9/WG.III/WP.246 (2025), https://perma.cc/2P9A-NNV7.
- 3Lauge N. Skovgaard Poulsen & Geoffrey Gertz, Reforming the Investment Treaty Regime: A ‘Backward-Looking’ Approach (2021), https://perma.cc/2746-S7QV;. Nick Robins, The Emergence of Sustainable Investing, in Sustainable Investing: The Art of Long Term Performance 3–18 (Cary Kronisky & Nick Robins eds., 2008). Nazly D. Gomez & Daniel R. Jurado, New Directions in International Investment Law: Alternatives for Improvement (2021), https://perma.cc/7U7C-CZMZ.
- 4Gomez & Jurado, supra note 3, at 14–19 (proposing, as means of ISDS reform, procedural solutions such as (i) exhaustion of local remedies, (ii) fork in the road, (iii) counterclaims, (iv) keyholes, (v) regulation of third-party funding, and (vi) dismissal of frivolous claims, as well as substantive solutions such as (i) reduction of the reach of obligations and (ii) elimination of obligations). See also Skovgaard & Gertz, supra note 3, at 1–8 (criticizing policymakers’ emphasis on constraining or replacing ISDS only in future IIAs and proposing for alternative approaches, such as a plurilateral mechanism for “interpretative statements,” in which governments jointly clarify and define their positions on contentious clauses in their existing IIAs).
- 5For instance, (i) in 2019, Pakistan was required to pay USD 6 billion in compensation to an Australian mining company—an amount equivalent to the IMF bailout it received that year. See Tethyan Copper Company Pty Limited v. Pakistan, ICSID Case No. ARB/12/1, Award, 12 July 2019; Recited from Skovgaard & Gertz, supra note 3, at 2; (ii) as of March 2025, Honduras is reportedly facing ISDS claims totaling USD 18 billion—some linked to energy transition—an amount that surpasses the country’s entire annual budget. See Phoebe Western & Patrick Greenfield, Why Fear of Billion-Dollar Lawsuits Stops Countries Phasing Out Fossil Fuels, The Guardian (Mar. 6, 2025), https://perma.cc/RGE9-5JW3.
- 6The empirical literature on ISDS has instead focused on (i) the impact of ISDS provisions on the amount of foreign directive investment. See Eric Neumayer & Laura Spess, Do Bilateral Investment Treaties Increase Foreign Direct Investment to Developing Countries?, 33 World Dev. 1567 passim (2005); Peter Egger & Michael Pfaffermayr, The Impact of Bilateral Investment Treaties on Foreign Direct Investment, 32 J. Comp. Econ. 788 passim (2004); Andrew Kerner, Why Should I Believe You? The Costs and Consequences of Bilateral Investment Treaties, 53 Int. Stud. Q. 73 passim (2009); (ii) the impact of ISDS filings on IIA practices see Lauge N.S. Poulsen & Emma Aisbett, When the Claim Hits. Bilateral Investment Treaties and Bounded Rational Learning, 65 World Politics 273 passim (2013); and (iii) determinants of ISDS outcomes see Daniel Behn, Tarald L. Berge & Malcolm Langford, Poor States or Poor Governance? Explaining Outcomes in Investment Treaty Arbitration, 38 Nw. J. Int’l L & Bus. 333 passim (2018); Julian Donaubauer, Eric Neumayer & Peter Nunnenkamp, Winning or Losing in Investor-to-State Dispute Resolution: The Role of Arbitrator Bias and Experience, 26(4) Rev. Int’l Econ. 892 passim (2018).
- 7Christian Bellak & Markus Leibrecht, Do Economic Crises Trigger Treaty–Based Investor–State Arbitration Disputes?, 24 J Int’l Econ L 127 passim (2021).
- 8Tarald L. Berge, Dispute by Design? Legalization, Backlash, and the Drafting of Investment Agreements, 64 Int’l Stud. Quarterly 919 passim (2020).
- 9UNCTAD, supra note 1.
- 10Alexander Thompson, Tomer Broude & Yoram Z. Haftel, Once Bitten, Twice Shy? Investment Disputes, State Sovereignty, and Change in Treaty Design, 73 Int’l Org. 859 passim (2019).
- 11Leonardo Baccini, Andreas Dür & Yoram Haftel, Imitation and Innovation in International Governance: The Diffusion of Trade Agreement Design, in Trade Cooperation: The Purpose, Design and Effects of Preferential Trade Agreements (Andreas Dür & Manfred Elsig eds., 2015).
- 12Todd Allee & Clint Peinhardt, Evaluating Three Explanations for the Design of Bilateral Investment Treaties, 66 World Politics 47 passim (2014).
- 13UNCTAD, supra note 1.
- 14Cosine similarity (also known as the normalized dot product) between two vectors in an embedding space measures their semantic closeness, typically yielding higher values when the vectors have large components in the same dimensions, which indicates that they have similar meanings. See Daniel Jurafsky & James H. Martin, Speech and Language Processing 110–11 (3d ed. 2025), https://perma.cc/R27F-ZC6N (last visited Apr. 27, 2027). The cosine similarity between two vectors v and w can be computed as:
cosinev, w=v∙wv|w|=i=1Nviwii=1Nvi2i=1Nwi2
This metric is particularly useful for retrieving specific provisions from lengthy agreements based on their semantic relationship to keywords such as treatment, expropriation, and arbitration.
- 15Wolfgang Alschner, Manfred Elsig & Rodrigo Polanco, Introducing the Electronic Database of Investment Treaties (EDIT): The Genesis of a New Database and Its Use, 20 World Trade Rev. 73, 73–94 (2021).
- 16FET provisions are often located in articles titled “Promotion and Protection of Investors” or similar. Such articles were included in the analysis, except when they exclusively address the facilitation and admission of investment without explicitly providing fair, equitable, or non-discriminatory treatment of investors.
- 17An embedding is a vector of floating-point numbers. Vectors representing tokens—often morphemes that make up words—are placed (“embedded”) in a vector space such that the distance between two vectors reflects their relatedness: smaller distances indicate higher relatedness, while larger distances indicate lower relatedness. See Vector Embeddings, OpenAI, https://perma.cc/9MCK-9H25 (last accessed Mar. 29, 2025).
- 18See Glamis Gold Ltd. v. United States, UNCITRAL, Award ¶¶ 602–03 (June 8, 2009) (referencing model BITs as one of the “very few authoritative sources” for customary international practices).
- 19The silhouette score, which evaluates how well k-means clustering groups data points is highest (0.3862) when the number of clusters (k) is three. See Peter J. Rousseeuw, Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis, 20 Comput. Appl. Math. 53 passim (1987). The scores are 0.2708, 0.2978, 0.2993, 0.2501, 0.2615, and 0.2553 for k =2, 4, 5, 10, 20, 50, respectively.
- 20J.A. Nelder, Log Linear Models for Contingency Tables: A Generalization of Classical Least Squares, 23(3) J. R. Stat. Soc., Series C (Appl. Stat.) 323 passim (1974).
- 21Bellak & Leibrecht, supra note 7, at 138–41.
- 22Thanh C. Nguyen, Vítor Castro & Justine Wood, A New Comprehensive Database of Financial Crises: Identification, Frequency, and Duration, 108 Econ. Model. 1, 12–15 (2022).
- 23Int’l Monetary Fund, World Economic Outlook Database: Groups and Aggregates Information (2023), https://perma.cc/A6EK-NHAD (last accessed Mar. 29, 2025,).
- 24Bellak & Leibrecht, supra note 7, at 143–44 (“This variable captures the awareness of foreign investors from a country of the possibility to bring a case against a particular host country before an arbitration council.”).
- 25UNCTAD, UNCTAD IIA Mapping Project, https://perma.cc/3YE7-EU4Q (last accessed Mar. 29, 2025).
- 26The coefficient of the model is to be interpreted as the increase in the logarithm of the count of ISDS claims when each independent variable increases by 1.
- 27Diane Lambert, Zero-Inflated Poisson Regression, with an Application to Defects in Manufacturing, 34 Technometrics 1 passim (1992).
- 28For variables other than Crisis, Party, Terminate, and USBITSim, see UNCTAD, supra note 25, at 7–19.