Urban Activity Dynamics During Republic Day in Istanbul#
In this case study, we apply our methodology to anonymized mobility data from Veraset to assess the impact of a large, planned public event on urban activity patterns. Understanding how populations visit urban areas during such events provides valuable insights for urban planning, infrastructure management, and emergency preparedness. We focus on Republic Day in Istanbul (29 October 2023), a major national holiday in Turkey marked by large public gatherings, celebrations, and increased activity across urban spaces. Using the Urban Space Usage Index, we identify deviations from typical activity patterns and examine how different areas of the city respond to the event.
1. Data#
1.1 Mobility Dataset#
The analysis is based on the Veraset Movement dataset, provided by Veraset as part of the Mobility Data collection from the Development Data Partnership. The dataset consists of anonymized mobile device location pings collected via a network of mobile applications and software development kits (SDKs). Each record includes geographic coordinates, a timestamp, and an anonymized device identifier. These data provide large-scale observations of human mobility, enabling the analysis of spatial and temporal patterns of urban activity.
1.2 Area of Interest (AOI)#
The analysis focuses on Istanbul, Turkey. The study area is defined using an administrative boundary shapefile (Figure 1, source).
Figure 1. Administrative boundary of Istanbul used to define the area of interest (AOI). All mobility data are spatially clipped to this region and aggregated using the H3 hierarchical grid system.
We spatially discretize the area of interest using the H3 Uber hierarchical indexing at resolution 8. This corresponds to hexagonal cells of approximately 0.737 km². Each H3 cell (or hexagon) represents the spatial unit of the analysis.
1.3 Time window and study periods#
To capture mobility dynamics before, during, and after Republic Day, which occurs from 13.00 on 28 October to all 29 October, we extract data for the period 2 October to 10 November 2023, spatially clipped to the study area (see Figure 2).
We define three analysis periods:
Baseline period: 2-27 October
Event period: 28-29 October
Post-event period: 30 October - 10 November
The extracted dataset consists of approximately 23 million GPS points generated by 242,400 unique users.
Figure 2. Spatial distribution of GPS observations shown as the average number of records per H3 hexagon (resolution 8). Higher values represent a greater concentration of recorded activity. The color scale is log₁₀-transformed, with darker blue tones indicating areas with more observations.
1.4 Preprocessing and filtering#
To ensure data quality and reduce noise, we apply a set of preprocessing steps. Users with very low daily activity are excluded, as they do not provide reliable information on spatial behavior. In particular, we retain users with at least three recorded points per day. As shown in Figure 3, increasing the minimum points-per-day threshold progressively removes low-activity users and reduces sharp spikes driven by users with only one or very few daily points, while preserving broadly consistent temporal trends across filtering levels.
In addition, H3 hexagons are retained only if they are consistently active throughout the observation period. Hexes with insufficient activity are removed to avoid unstable estimates and inflated Z-scores.
The final dataset consists of approximately 22,336,000 observations from 131,500 users covering 1,784 spatial units.
Figure 3. Daily number of active users retained under different minimum points-per-day thresholds (\(\geq\) 1,2,3,5, and 10 points). Increasing the threshold progressively removes low-activity users, reducing sharp spikes driven by users with only one or very few daily points, while preserving temporal trends.
2. Methods#
2.1 Urban Space Usage Index#
To capture spatial activity at a daily resolution, we use the Urban Space Usage Index (\(I\)), defined as the number of unique users visiting each H3 hexagon normalized by the total number of active users on that day:
where \(U_{h,d}\) is the number of unique users in hexagon \(h\) on day \(d\), and \(U_d\) is the total number of active users on day \(d\).
The index can be interpreted as the share of total observed activity occurring in each hexagon. The number of unique users is used as a proxy for human presence, assuming that higher user counts correspond to greater spatial utilization. For more details on the index, please refer to the Methodological Framework.
2.2 Deviation Measurement (Z-score)#
To identify event-driven deviations, we define a baseline period using pre-event days (2-27 October 2023). For each hexagon, we compute the baseline mean and standard deviation of the Urban Space Usage Index.
Deviations from typical conditions are quantified using a Z-score, computed for each hexagon and day as:
where \(\mu_h\) and \(\sigma_h\) are the average and standard deviation of \(I_{h,d}\) during the baseline period, respectively.
The Z-score measures how strongly observed activity deviates from expected baseline levels. Positive values indicate higher-than-expected activity, while negative values indicate lower-than-expected activity relative to baseline conditions. The magnitude (i.e., absolute value) of the Z-score reflects the strength of the deviation, with larger values indicating more pronounced anomalies.
3. Results#
3.1 Temporal evolution of urban activity#
The temporal evolution of the Urban Space Usage Index shows a noticeable shift in urban activity during the event period compared to typical conditions (see Figure 4). Activity levels rise in the days leading up to the event, peak on the event day, and decline shortly thereafter. As expected for a large planned event, this pattern suggests anticipatory and residual mobility effects, likely driven by preparation activities, public gatherings, and increased presence in central areas.
Figure 4. Time series of the average Urban Space Usage Index (I) across all hexagonal cells in Istanbul, with shaded area representing the interquartile range. The highlighted band marks the Republic Day period (28-29 October), during which a clear increase in activity is observed.
3.2 Anomaly detection#
To assess whether changes in activity correspond to statistically significant deviations from typical conditions, we use the Z-score, which measures how many standard deviations observed activity deviates from its baseline.
As shown in Figure 5, during the baseline period, as expected, Z-scores fluctuate around zero, indicating stable activity levels. Two localized positive peaks are observed in mid-October, which may be associated with known protest events and reflect short-lived increases in mobility (see, e.g., EFE, 18 Oct 2023 and Balkan Insight, 20 Oct 2023).
The Republic Day event exhibits a clear increase in Z-score, beginning in the days preceding the event and peaking during the main celebration.
On 28 October, the day preceding the Republic Day main event, the average Z-score rises from -0.2 to 2.13, indicating a positive deviation from baseline conditions and marking the start of increased activity. On the main event day, 29 October (Republic Day), the average Z-score is 4.05, indicating an extreme positive anomaly due to increased activity relative to normal conditions. This peak is markedly higher than all other fluctuations observed in the time series.
On the day following the event, Z-scores remain elevated, likely reflecting continued celebrations extending past midnight and into the following day. After 31 October, the Z-score rapidly declines, returning to values close to zero.
This quick recovery toward typical mobility patterns suggests that the observed temporary disruption is closely associated with the Republic Day event.
Figure 5. Time series of the average Z-score of the Urban Space Usage Index across hexagons in Istanbul, with shaded area representing the interquartile range. Horizontal dashed lines indicate anomaly thresholds. The highlighted band marks the Republic Day period (28-29 October), during which activity reaches extreme positive deviations.
3.3 Spatial distribution of anomalies#
The map in Figure 6 illustrates the spatial distribution of Z-scores across hexagonal cells on 29 October (Republic Day). It shows that the increase in activity is not confined to specific locations but is instead widely distributed across the metropolitan area of Istanbul, including airport areas and major road infrastructure. This pattern confirms that the anomaly is spatially diffuse, affecting both central areas and peripheral zones.
Figure 6. Map of the Z-score of the Urban Space Usage Index across H3 hexagonal cells in Istanbul on 29 October 2023. Warmer colors indicate higher-than-expected activity, showing a widespread increase across the metropolitan area.
We further examine the evolution of Z-scores over the entire observation period. During the baseline phase, Z-scores remain close to zero. On Republic Day, the map becomes uniformly dominated by high values. Following the event, Z-scores rapidly decline, returning to baseline levels consistent with typical mobility patterns (Figure 7).