Voxel modeling and association of ubiquitous spatiotemporal information in natural language texts

ABSTRACT The ubiquitous spatiotemporal information extracted from Internet texts limits its application in spatiotemporal association and analysis due to its unstructured nature and uncertainty. This study uses ST-Voxel modeling to solve the problem of structured modeling and the association of ubiquitous spatiotemporal information in natural language texts. It provides a new solution for associating ubiquitous spatiotemporal information on the Internet and discovering public opinion. The main contributions of this paper include: (1) It proposes a convolved method for ST-Voxel, which solves the voxel modeling problem of unstructured and uncertain spatiotemporal objects and spatiotemporal relation in natural language texts. Experiments show that this method can effectively model 5 types of spatiotemporal objects and 16 types of uncertain spatiotemporal relation founded in texts; (2) It realizes the unknown event discovery based on voxelized spatiotemporal information association. Experiments show that this method can effectively solve the aggregation of ubiquitous spatiotemporal information in multi-natural language texts, which is conducive to discovering spatiotemporal events. The selection of convolution parameters in voxel modeling is also discussed. A parameter selection method for balancing the discovery capability and discovery accuracy of spatiotemporal events is given.


Introduction
The rise of various information-sharing software represented by social media has led to the production of many texts containing spatiotemporal information daily (Yang et al. 2020).This spatiotemporal information, called ubiquitous spatiotemporal information, is enormous in quantity, inconsistent in structure, and uncertain.After the extraction of such ubiquitous spatiotemporal information from text, including geographical names (Hu et al. 2022;Ma and Hovy 2016), spatial relation (Qiu et al. 2019;Stock et al. 2022) and time information (Kim and Myaeng 2004), processing and application of the extracted spatiotemporal information is crucial and challenging.
The processing difficulties mainly come from the following two aspects.The first is the unstructured and uncertain nature of ubiquitous spatiotemporal information.There are different ways to describe the same spatiotemporal area.It may be latitude and longitude and time stamps, place names and time periods, or spatial relation based on a specific address and temporal relation based on a particular time (Cadorel, Blanchi, and Tettamanzi 2021;Tissot et al. 2019).Additionally, the information might be inaccurate.It may contain uncertain descriptive words such as 'around'.Even specific numbers, such as 'east side' and '1 km', are generally subjective judgment results that have not been accurately measured.Ubiquitous spatiotemporal information describes an uncertain range with vague boundaries and is a semantic field.
The existing processing of ubiquitous spatiotemporal information is mainly based on membership functions, and quantitative modeling methods are designed for various spatiotemporal relation (Peuquet and Ci-Xiang 1987;Tissot et al. 2019;Y. Wang et al. 2018;Z. Wang et al. 2018).These methods can effectively obtain the modeling results of ubiquitous spatiotemporal information in the form of coordinates and time scales.While analyzing similar spatiotemporal areas recorded in this form, it is necessary to calculate the spatiotemporal relation to determine whether these coordinates describe the same place.Hence, modeling has not radically resolved the information's uncertain nature.Also, the geometry of the fuzzy semantic field is complex, and it isn't easy to describe it directly with coordinates or equations.
When it comes to the application step, existing solutions need to access the spatiotemporal information associated with the event (based on event discovery) for positioning and next-step analysis (Bakillah, Li, and Liang 2015;Kuflik et al. 2017;Zhou, Xu, and Kimmons 2015).These methods rely not only on the pre-discovery of the event, but also on searching for information about the event that occurred, such as the timely and correct detection of target events (Sakaki, Okazaki, and Matsuo 2010) and the reporting of relevant news (Devès et al. 2019).In the case of incorrectly related queries, the spatiotemporal information obtained may not be related to the event itself.It will easily result in an incorrect analysis of the event.Another disadvantage is that discovering unknown events from spatiotemporal information is difficult.
Aiming at spatiotemporal association of ubiquitous spatiotemporal information, existing methods process the modeling results using clustering (Ansari, Prakash, and Mainuddin 2018).A related index query method has been proposed (Chen et al. 2023), where it is necessary to import many results into the spatiotemporal analysis system to perform association.It is challenging to cope with the theoretically infinite growth of spatiotemporal information.
To solve the processing and association problems, we introduce the spatiotemporal voxel (ST-Voxels), a kind of spatiotemporal geometry that realizes the discrete expression of spatiotemporal regions (J.Li et al. 2022;Smith and Dragićević 2022;Stilla and Xu 2023), based on the following two motivations: (1) When the ST-Voxels determined by the spatiotemporal range are used to represent the ubiquitous spatiotemporal information, the unstructured texts is transformed into a structured set of voxels.The ST-Voxels used to represent the spatiotemporal information will be likely for different descriptions of the same spatiotemporal region.There is no need to judge through complex spatiotemporal relation.Moreover, voxel modeling of the field has been applied to physical fields like magnetic fields (Odawara et al. 2011), and its effectiveness has been confirmed.Therefore, ST-Voxels suit the modeling and expression of spatiotemporal information.(2) When the event associated with the spatiotemporal information cannot be determined in advance, if associating the spatiotemporal information with the ST-Voxels, a large amount of spatiotemporal information will naturally accumulate in the limited spatiotemporal voxel, which leads to discovering unknown events after the spatiotemporal association.Moreover, studies have shown that the discretization of spatiotemporal data helps extract spatiotemporal information features, which can be used in predictive learning, anomaly detection, and other fields (S.Wang, Cao, and Yu 2022).In addition, the number of voxels used in the voxelization modeling results of ubiquitous spatiotemporal information is limited, and the number does not become infinite as the number of spatiotemporal information increases.Therefore, the voxelization modeling of ubiquitous spatiotemporal information contributes to its information association and event finding.
Existing ST-Voxel models are generally used to process traditional deterministic spatiotemporal information (Poux and Billen 2019;Stilla and Xu 2023;Xu, Tong, and Stilla 2021), and have limited power to process information that is unstructured and ambiguous.Therefore, it is necessary to expand and supplement the existing ST-Voxel model before it can solve the modeling and application of ubiquitous spatiotemporal information.Our main contributions are: (1) We introduce and expand the ST-Voxel concepts for the processing of unstructured and uncertain ubiquitous spatiotemporal information; (2) We propose a ST-Voxel convolution method to handle ubiquitous spatiotemporal information's unstructured and uncertain nature.The fuzzy semantic field described by various structures and quantities of ubiquitous spatiotemporal information can be modeled as a limited number of deterministic spatial voxel objects.
(3) The application potential of our proposed method has been demonstrated by the information association method, which also enables unknown event discovery.
The article is organized in the following structure: Section 2 introduces the adaptation of the ST-Voxel model, and details the voxel modeling method for spatiotemporal information; Section 3 presents the voxelized information correlation method for event correlation and spatiotemporal correlation; and Section 4 verifies our method's feasibility and effectiveness.and the voxel relational operator are proposed for the spatiotemporal objects and the spatiotemporal relation, respectively.The convolved voxel object is then obtained from the voxelized result for the spatiotemporal object.The voxelization modeling results can be finally obtained through the convolution operation of the convolved voxel objects and the voxel relational operators.Figure 1(b) lists 3 examples of spatiotemporal information extracted from texts which are the input of the voxelized modeling method.The 1st ubiquitous spatiotemporal information 'Eiffel Tower; East; 500 m' is extracted from text 'we held a party 500 meters from the east side of the Eiffel Tower', and Figure 1(c) is the voxelized modeling result of that information.

Adaption of ST-Voxels
This paper combines spatial position coding (Lei et al. 2020) and time coding (Tong et al. 2019) to expand the concept of ST-Voxels.The spatial range represented by the spatial coding is used as the bottom of the voxel, representing the voxel's spatial range.The time range defined by the time coding is used as the height of the voxel, representing the voxel's time range.The expanded concept of ST-voxels includes three parts: the coordinate system, the identification, and the attributes.
(1) ST-Voxels have absolute and relative coordinate systems, where the absolute coordinate system is used to identify the absolute position of the ST-Voxels, and the relative coordinate system is used to identify the relative position of each voxel within a certain number of ST-Voxel sets.The absolute coordinate system used in this study adopts the latitude and longitude projection.The E-axis coincides with the equator and points due east.The N-axis coincides with 0°longitude and points due north.The T-axis represents time pointing to the future (generally, a specific reference time is the origin of the T-axis).The three axes form a right-handed rectangular coordinate system ('a' in Figure 2), the scale of which depends on the voxel resolution (In this article, it is also the segmentation scale of coding); the origin of the relative coordinate system is at the center of a voxel.The I-axis points east, the J-axis points north, and the K-axis represents the time axis pointing to the future, and these three axes form a right-hand rectangular coordinate system with the I-and Jaxes.The scale of the three axes depends on the voxel resolution.
(2) In this study, a joint code comprising spatial coding (Lei et al. 2020) and temporal coding (Tong et al. 2019) was used to identify ST-Voxels uniquely.The absolute identification of the ST-Voxel in Figure 2 is 6989586621679009791-284606568362672127 ('b' in Figure 2).In a relative reference frame, ST-Voxels can also be identified by three-dimensional coordinates of the form (i, j, k).(3) The attributes of the ST-Voxels are used to describe the objects, events, information, Etc., present in the voxel.The attribute can be deterministic, with 0 or 1 indicating existence or absence.It can also be non-deterministic, represented by the membership degree [0, 1] in fuzzy mathematics, which indicates the probability of occurrence of objects, events, and information in ST-Voxels.The ST-Voxel attribute value shown in Figure 2(a) is 1, which means that the entity is located in the space-time range.
In this study, we use voxels of different colors to distinguish different voxel models.Among them, the gray voxel is the voxelization result of the reference entity when modeling the spatiotemporal relations; the white voxel represents the attribute value of the voxel at the location is 0; the blue voxel is the voxelization modeling result of the spatiotemporal information, where the darker the blue, the greater the attribute value of the voxel.Voxels of other specific colors are described in detail when they first appear and will not be repeated here.

Spatiotemporal objects
Location-related spatial information in natural texts has two forms: geographic locations (Monteiro, Davis, and Fonseca 2016) and spatial relation (Stock et al. 2022).There are usually four forms of temporal information in natural texts: time points, time periods, fuzzy time, and temporal relation (Campos et al. 2015).As temporal relation (spatial relation) is based on the other three types of temporal information (geographic location), the name of time points, time segments, and fuzzy time reference geographic locations are collectively referred to as temporal objects.Because the time and space in voxels are orthogonal to each other, spatial and temporal information can be modeled separately and combined into an individual voxel modeling result.Hence, the voxelized modeling methods involve 5 types of spatiotemporal objects and 16 types of spatiotemporal relation (Allen 1983;Deng, Li, and Wu 2013) listed in are discussed separately here (Table 1).

Geographic locations
The absolute identification of voxels involves a combination of grid and time coding, so geographic location modeling is inseparable from the grid and coding methods of geographic entities (Lei et al. 2020;Peterson 2011;Sahr 2019;Wu et al. 2021).A quadtree-based latitude-longitude grid (Hartmann et al. 2016;Lei et al. 2020) was used to project voxels in space for modeling.The spatial resolution of voxels was related to the application scenario (e.g. for point of interest (POI) data, the resolution was 82 m × 94 m).The attributes of the voxel identified whether the object appeared in the spatiotemporal range, and a value of 1 indicated that the object was located at the voxel position.
Geographic locations include direct coordinates, place names, and other indirect locations (Monteiro, Davis, and Fonseca 2016).Indirect location queries have been extensively studied for geographic information retrieval (Baeza-Yates and Ribeiro-Neto 1999) and are not repeated here.The voxel modeling method for coordinates and place names in a unit time period is introduced below.2.2.1.1.Coordinates.Geographic locations in the form of coordinates can be uniquely identified by a spatial code and then combined with the corresponding time to produce ST-Voxel modeling results (Figure 3).When fuzzy quantization was not involved, the attribute value of the voxel was 1.
2.2.1.2.Place names.When spatial information as place names was modeled, the following considerations were necessary: (1) when a place name database (Adams 2017;Wiki 2022) was used as auxiliary data, the corresponding longitude and latitude sets were searched by matching place names; and (2) when a network map was used, its API service (like AutoNavi Map API; AMAP 2022) was used to obtain the coordinates of place names.Finally, the spatial modeling results of place names were obtained according to the coordinate modeling method (Figure 4).Among the three types of auxiliary data, the coordinate data from the longitude-latitude place names database were the most accurate, and the modeling results were closest to the actual geographic range of place names.The coordinate data of the grid gazetteer were less accurate than those of the latitude and longitude place name database owing to the influence of the grid, and some voxelization results were omitted or incorrectly increased.The web map API returned latitude and longitude coordinates for place names, and the voxelization result was generally irrelevant to the real geographic shape.

Temporal objects
Normalized temporal structures like 'March 21, 2022' can be directly converted to one-dimension integer encodings (Tong et al. 2019), so converting 3 types of temporal objects to a standard format enables their voxelized modeling.
2.2.2.1.Time points.Time information in the form of time points needs to be transformed into a normalized time structure considering the context and then converted into integer codes as the  modeling results for time points.The height of the voxels was the time interval represented by the integer code, and the attribute value of the voxel was 1 (Figure 5).

Time periods.
To model time information in the form of time periods, the start and end time points were obtained and converted into temporal codes by modeling temporal objects in the form of time points.Then, all integer codes in the two temporal code intervals under the current time resolution were used as the modeling results for the time periods (Figure 6).
2.2.2.3.Fuzzy time.Firstly, the fuzzy times represented by 'early morning' and 'evening' were converted into time points baseT using a simple mapping method.Then, the membership cloud method (Li et al. 1998) was used to solve its ambiguity.This is done as follows.Let E x be the smallest scale that is not 0 in baseT (for example, 6 o'clock in 6:00; 24 min in 00:24).For each value pair {x i , y i }(i [ 1, 2, . . ., N) generated, let T i = baseT + x i (a positive sign for x i indicates time delay and a negative sign represents time advance) and assign membership degree y i to the time code corresponding to T i .After summing and normalizing the N membership degree according to the time code, the value range of each membership degree was from 0 to 1, which was used as the time membership degree of each voxel.If the spatial modeling results had membership degrees, the corresponding membership degrees were multiplied (Figure 7).

Spatiotemporal relation
The spatiotemporal relation can determine the spatiotemporal location of an object through the location of another object and their relation.Therefore, modeling should also follow this rule: the reference object and the spatiotemporal relation are modeled separately, and the spatiotemporal position of the target object is obtained through some combination of these.For different reference objects having the same spatiotemporal relation, no matter how the reference objects change, the target spatiotemporal position can be obtained from the same modeling results for the spatiotemporal relation.This is similar to the concepts of system input and impulse response in convolution.At the same time, in the ST-Voxel relative coordinate system, the position of each voxel is uniquely identified by three-dimensional integer coordinates, which is suitable for linear operations.The convolution operation of ST-Voxels is defined here to realize the modeling of spatiotemporal relation.

Mathematical principles of ST-Voxel convolution
For real space R m×n×l with m rows, n columns, and  where: A relative coordinate system can be established with the geometric center of the voxel set, V = {V n |n [ N + } as the origin, and a direction parallel to the absolute coordinate system.The scale of the relative coordinate system was the resolution of the voxels. ), wherein element B i,j,k was the attribute value of the voxel whose coordinates were ( one-to-one, then the attribute value of the voxel is called the spatiotemporal membership degree, which indicates the probability of the object appearing at this position.We call V the convolved (voxel) object, B the (voxel) object matrix, C the (voxel) relation operator, and V ′ the (voxel) convolution result.Figure 8 presents an intuitive schematic of the voxel convolution process.relation modeling is to obtain the geographic locations of a target entity through the reference object's geographic locations and their spatial relation.

Spatial relation
During spatial relation modeling, the convolved object was obtained from the reference object, and the relational operator was generated according to the spatial relation (because the spatial relation does not involve time, only two scales of the relation operator plane are discussed).The convolution result was processed to obtain the inference result for the spatiotemporal position of the target entity.
2.3.2.1.Generation of convolved objects.Directional words (such as 'east' and 'south') in the text not only express the directional relation of the target entity but also restrict the reference entity.For example, for the description 'The restaurant is 500 m east of the school,' 'east' refers to the directional relation between the 'restaurant' and 'school', while '500 m' refers to the approximate distance between the 'restaurant' and 'the part of the school located in the east,' rather than the distance between 'restaurant' and 'the geometric center of the school'.Therefore, the convolved object was obtained by dividing the voxel modeling results of the reference object according to the directional relation and then recombining them.This reduced the workload of subsequent convolution and made the modeling results of the spatial information more accurate.
Note that the voxelized result of the reference object was V R = {V n |n = 1, 2, . . ., N}, and the geometric center of V R was ( I, J, K).
is the rounding symbol).Then, the final convolved object V was obtained according to formula (2), named North Voxel (NV), Northeast Voxel (NEV), East Voxel (EV), Southeast Voxel (SEV), South Voxel (SV), Southwest Voxel (SWV), West Voxel (WV), Northwest Voxel (NWV), and Boundary Voxel (BV).If V R was observed as in Figure 9(a), then V was defined as in the dark part of Figure 9 2.3.2.2.From spatial relation to relational operators and convolution operations.Because of the inaccuracy of personal perceptions and descriptions, spatial relation typically has some ambiguity.This study used a random number (pan-normal) generator with a stable tendency (Li et al. 1998) to quantify ambiguous qualitative spatial relation.According to the characteristics of the spatial relationship, this paper uses expectation and convolution parameters to calculate other parameters in this method.The higher the uncertainty of the spatial relation, the greater the convolution parameter.
The relational operator generation process and convolution operation for three types of basic spatial relation and the complex relation formed by combining basic relation are described.When the distant relation is not included in the text, the magnitude of the relational operator is generally an integer multiple of the diameter of the circumcircle of the convolved object, and it is an odd number.Topological relation is of three types: adjacent, intersecting and separated.As the topological relation is generally constant, a fixed relational operator was used for modeling.Table 2 shows an example of the modeling process for topological relation.B 1 and B 2 in Table 2 are 3 × 3 Corrosion and Expansion Structural Elements, respectively (Castleman 1996).
Directional relation: A polar coordinate system was established with due east as the positive direction of the polar axis and the center of the relational operator as the origin.Let E x be the radian of the direction in the polar coordinate system (for example, due east corresponds to E x = 0, and due north corresponds to E x = p/2).For each value pair {x i , y i }(i [ 1, 2, . . ., N) generated, a relational operator C i was obtained, and y i was assigned to all positions where it intersected rays with radians x i .The relational operator of the directional relation was obtained by normalizing the sum of N relational operators.The relation operator construction process for the directional relation is shown in Figure 10.The modeling process for the directional relation is shown in Figure 11.The gray part of Figure 11 represents the convolved object.
Distant relation: Let E x be the number of voxels corresponding to the distance at the voxel spatial resolution during modeling (for example, when the spatial resolution of the voxel was 20 m × 20, 500 m corresponded to E x = 25).For each value pair {x i , y i }(i [ 1, 2, . . ., N) generated, a relational operator C i with a column of 2[x i ] + 1 was obtained, and y i was assigned to all positions in the relational operator that intersected the circles whose center was C i and whose radius was x i .The relational operator of the distant relation was obtained by normalizing the sum of N relational operators.The relation operator construction process for the distant relation is shown in Figure 12.The modeling process of the distant relation is shown in Figure 13.
Other spatial relation: This can be regarded as a complex relation formed through a combination of three types of basic spatial relation.The basic spatial relation that comprised this were modeled, and the intersection of the results according to equation (3) was the modeling result for other spatial relation.where, SCode, TCode, and d are spatial coding, temporal coding, and membership degree, respectively; V ′ 1 and V ′ 2 are the modeling results of two basic spatial relation; and > is the intersection operator.Figure 14 shows the modeling process of a compound relation comprising directional and distant relation.

Temporal relation
The modeling method of 9 types of temporal relation (during; contains; overlaps; overlapped-by; start; started-by; finished; finished-by; equals) is the one-dimensional form of the topological   relation modeling method.Table 3 shows an example of the modeling process for those temporal relation (Figure 15).
Another 4 types of temporal relation (meets; met-by; before; after) are same as directional relation and distant relation.while modeling, the convolved object was obtained from the reference temporal object, and the relation operator was generated according to the one-dimensional time span.Let E x be the number of voxels corresponding to the time span at the voxel time resolution, with the sign of E x being negative in the direction of time progression.For each value pair {x i , y i }(i [ 1, 2, . . ., N) generated, a relational operator C i with 2[|x i |] + 1 rows and 1 column were obtained as shown in equation ( 4).The relational operator of the temporal relation was obtained by normalizing the sum of N relational operators.The relation operator construction for the temporal relation 'after' is shown in Figure 16.
The modeling process of the temporal relation 'after' is shown in Figure 17.

Voxelized information association
Voxelized information will naturally aggregate in voxels.The voxelized information association is essential for event association and spatiotemporal association.The voxelized information association is based on the ST-Voxel union method shown in equation ( 5) and the ST-Voxel intersection    method shown in equation ( 3).
where, SCode, TCode, and d are spatial coding, temporal coding, and membership degree, respectively; V ′ 1 and V ′ 2 are the modeling results of two ubiquitous spatiotemporal information modeling results; and < is the union operator.Owing to the boundedness of ST-Voxels, the intersection and union of a large number of spatiotemporal information modeling results were also bounded, which resolved the problem associated with infinitely increasing spatiotemporal information with an increasing number of objects.
The ST-Voxel union obtained spatiotemporal hot spots through the aggregation of the modeling results.The ST-Voxel intersection was used to extract ubiquitous texts associated with some voxels.On the one hand, associating information of a founded event can analyze its feature.On the other hand, associating spatiotemporal information in a region can find and predict events.

Event association
When an event is discovered, a large amount of spatial media text related to the event is obtained using the event name.The association results of event' spatiotemporal information is obtained based on the union of the modeling results of spatiotemporal information in the text.The spatiotemporal range corresponding to the ST-Voxel with the largest membership is the location where the event is most likely to occur.Figure 18 shows the process of associating the spatiotemporal semantics of the same event description in two sentences.First, the spatiotemporal information in the two sentences is modeled separately and obtains the described position.Then the two modeling results are union, and the properties of the voxels determine the spatiotemporal location of the event.The greater the membership of the voxel, the darker the color of the corresponding voxel, indicating the greater the probability of the event happening in that spatiotemporal location.

Spatiotemporal association
When an event occurs or is about to occur but has not been discovered, it is impossible to retrieve relevant information based on the event name.As the spatiotemporal properties of events are unique, the spatiotemporal descriptions of events made by event-related individuals naturally form aggregates in ST-Voxels.Therefore, if the spatiotemporal information in all texts in the time range (Figure 19(a)) is modeled and incorporated into the voxelized spatiotemporal space (Figure 19(b)), the potential events and spatiotemporally related texts can be identified as the ST-Voxels with comparatively higher membership (Figure 19(c)).For example, in the box above Figure 19(c), the time described in all statements is noon, and the spatial location is near Zhengzhou Library.Then, an event can be confirmed and further recognized using other information (such as attributes and relationships) of the related text in space and time.

Experimental setups
The following three experiments were carried out to validate our proposed method.
Spatiotemporal semantic modeling feasibility experiment: Voxelized modeling of various forms of spatiotemporal information.The modeling results were used to verify the feasibility of the modeling method.
Voxelized information association experiment: Through modeling amount of spatiotemporal information int a region, verify that the voxelized information association method can be used to discovery unknown events.
Modeling parameter comparison experiment: The convolution parameter that affects uncertainty during modeling is usually related to the distant relation.For example, in the text 'Restaurant 100 meters east of the supermarket', when the convolution parameter is set to 1/3, the modeling result range is about [60 m,140 m]; when the convolution parameter is set to 1, the modeling result range is about [0 m,300 m].In this experiment, 6 convolution parameters with an interval of 1/6 were selected for comparison experiments: 1/6, 1/3, 1/2, 2/3, 5/6, and 1.The influence of different convolution parameters on the modeling and association process is explained by comparing the spatiotemporal information modeling with the association results.And a parameter selection method that balanced the capability of spatiotemporal event discovery and the accuracy of discovery was presented.

Data and the environment
The source of the experimental data was the 'Notice on 5 Newly Confirmed COVID-19 Cases and 10 Newly Asymptomatic Infections in Zhengzhou City', issued by Zhengzhou, Henan, China from 1st to 10 May 2022, for 189 COVID-19 cases (Announcement 2022b).The content of the notification was the time and space trajectories of each patient.Its form is 'something was done at a certain time and place'.For example, 'On April 28th, 8:00-18:30 in the workplace, go shopping at Xinsheng Fruit Hypermarket at 18:41'.The spatiotemporal information in these data has apparent characteristics and includes various forms of spatiotemporal information, such as place names, spatial  In this study, the spatial information in the text was obtained using the GitHub open-source library pyltp (HuangFJ 2020), and the latitudes and longitudes corresponding to the place names were obtained using the AutoNavi Map API service (AMAP 2022).The temporal information in the text was obtained from the GitHub open-source library pyunit-time (jtyoui 2021).
As the location data in the notification were basically store-level POI data, the time data were generally a time point or time period accurate to the minute.Therefore, ST-Voxels with a spatial resolution of 82 m × 94 m and a temporal resolution of 30 min were used to model the spatiotemporal information.In the city-level application scenario, the spatial resolution was ignored to eliminate the orientation deviation caused by size differences, and the temporal resolution was suitable for the voxelization of temporal information on the time scale of several days.At the same time, the resolution of voxels was in line with the approximate spread of the infectious disease, which was reasonable.

Experiments results
The modeling results of the typical spatiotemporal semantics of 4 sentences are shown in Figure 21.The darker the color of the voxel in the figure, the greater its membership degree.
Figure 22(a) shows the ST-Voxel visualization results for more than 5 overlapping cases in the event discovery experiment.The color of the voxels represents the number of coincident cases: the darker the color, the greater the number.Due to the limited association of space and time of the patients, the voxel distribution was relatively discrete.Figure 22(b) lists the spatial scope, temporal scope and spatiotemporal-related cases in one voxel with the highest attribute value.
Figure 23 visualizes the comparative results of different modeling parameters.The left side shows all the spatiotemporal voxels in this area.On the right is the spatiotemporal range of the central

Spatiotemporal information association experiment
This experiment found the spatiotemporal gathering area of 22 cases '10:00-17:00 Qiantang Yicheng, April 30, 2022'.After manual screening and verification, it was found that the spatiotemporal trajectories of all patients overlapped in a certain ST-Voxel, while the trajectories of other patients did not overlap there.The feasibility of voxelized information association for discovering the unknown event of the infection chain was verified.The statistical results of the epidemic situation in this geographic location show that there will be no more aggregated epidemics after 19:00 on 2 May 2022, and it is judged that the place is divided into a separate management area.After the relevant information was manually reviewed, it was determined that the area was zoned on May 3 to control the spread of the epidemic (Announcement 2022a).Consistent with the conclusion obtained by the ST-Voxels, this verifies the feasibility of discovering the unknown event of control measures through the aggregation of spatiotemporal information in the voxels.4.4.3.Comparative experiment of modeling parameters When 1 and 5/6 (Figure 23(a,b)) are used as convolution parameters for modeling, the modeling results with fuzzy spatiotemporal information have the largest range.Spatiotemporal information is most likely to form aggregations in ST-Voxels.Although more spatiotemporal hot spots can be found, the spatiotemporal range of the hot spots is the largest.Irrelevant spatiotemporal information will also be aggregated, resulting in the low credibility of hot spots.The parameter is suitable for modeling spatiotemporal data with poor reliability.
When 2/3 and 1/2 (Figure 23(c,d)) are used as convolution parameters for modeling, the range of modeling results with fuzzy spatiotemporal information is smaller.Although the number of spatiotemporal hot spots discovered is small, the range of spatiotemporal hot spots discovered is smaller, and the credibility of the hot spots is higher.
When 1/3 and 1/6 (Figure 23(e,f)) are the convolution parameters used in modeling, the range of modeling results with fuzzy spatiotemporal information is the smallest.It is more difficult for spatiotemporal information to aggregate in ST-Voxels, but the spatiotemporal range of hot spots is the most accurate.The reliability of hot spots is the highest, and it is suitable for high-reliability spatiotemporal data modeling.
Therefore, for data with reliability similar to that of the data used in this study, such as official reports, 1/3 can also be used as a convolution parameter for the voxel modeling of spatiotemporal information and spatiotemporal association.If the data are less credible than the data used in this study, such as social media data, convolution parameters of 2/3, 1, or greater can be used for modeling to facilitate the spatiotemporal association of information.
Different events can be obtained using spatiotemporal data from different sources.For example, for the epidemic notification data used in this experiment, the spatiotemporal cross location of each case was identified according to the spatiotemporal aggregation.Discovering possible transmission chains provided a temporal and spatial basis for tracing the origin of the epidemic and delineating epidemic prevention and control areas; for temporal and spatial data in communication texts, users can focus on areas and activity trajectories based on temporal and spatial aggregation.Eventually, combining data with other attributes of the corresponding location can provide a reference for user identification, personalized recommendations, etc.

Conclusions and outlook
This study expands the existing ST-Voxels concept and designs a voxelization modeling method for spatiotemporal information in natural texts.The modeling of 5 types of spatiotemporal objects and 16 forms of relation are presented separately.In particular, a set of modeling ideas similar to convolution operations is proposed for spatiotemporal relation in voxelized space-time.Unknown event discovery based on voxelized information association is realized.Finally, experiments are designed to verify the feasibility of the modeling method and application.The influence and selected principle of the model parameters are also analyzed.Motivated by the exciting application results, we will further extend 3D space-time to 4D space-time to process 3D space data and apply the information for spatiotemporal aggregation in different scenarios.

Disclosure statement
No potential conflict of interest was reported by the author(s).
Program of Henan Province) under Grant number 221100211000-03, and The National Key R&D Plan of China (2018YFB0505304).

Figure 1
Figure 1(a) illustrates the proposed voxelized modeling process for ubiquitous spatiotemporal information.For extracted spatiotemporal information from texts, the object voxelized model

Figure 9 .
Figure 9. Example of reference object voxelization result and final convolved object.(a) The reference entity voxelization results are represented by gray, yellow, and green, where the yellow voxel's relative coordinates are (I l , J t , K), and the green voxel's relative coordinates are (I r , J d , K).

Figure 10 .
Figure 10.Relational operator construction process for the directional relation.

Figure 11 .
Figure 11.Modeling process for the directional relation.

Figure 12 .
Figure 12.Relational operator construction process for the distant relation.

Figure 13 .
Figure 13.Modeling process for the distant relation.

Figure 15 .
Figure 15.Convolution with B 1 , B 2 in the one-dimension.

Figure 16 .
Figure 16.Relational operator construction for the temporal relation 'after'.

Figure 17 .
Figure 17.Modeling process of the temporal relation 'after'.
relation, time points, time periods, and temporal relation.An example of the original (Figure 20 (a)) and the translated (Figure 20(b)) text of the notification is shown below.The annotated part is the text containing spatiotemporal information.

Figure 19 .
Figure 19.Event discovery process based on spatiotemporal association.

Figure 20 .
Figure 20.Notification example.(a) Original text of the notification; (b) Translated text of the notification.

Figure 21 .
Figure 21.Visualization of spatiotemporal information modeling.(a) Distant relation and time period.(b) Place name and time point; coordinate and temporal relation 'after'.(c) Directional relation, distant relation and fuzzy time.
spatiotemporal voxel and the spatiotemporal-related cases.As the convolution parameters decrease (a-f), all voxels shrink, and the related cases increase.4.4.Discussions 4.4.1.Feasibility experiment of spatiotemporal semantic modeling The proposed voxel modeling method can model 5 types of spatiotemporal objects and 16 forms of spatiotemporal relation contained in origin data.The modeling results of the topological relation 'near the park' and the time period 'May 1st 20:30-21:30' in 'Eating at a barbecue stall near the park from 20:30-21:30 on May 1st' are multiple ST-Voxels (Figure 21(a)); the modeling results of the place name 'community Xikelai Life Supermarket' and the time point 'May 2nd, 12:58' in 'May 2nd, 12:58 to go shopping at the community Xikelai Life Supermarket' are a single ST-Voxel (Figure 21(b)); the modeling results of the coordinates (1133.66°E,34.17°N) and the temporal relation 'after 2 hours' in 'After 2 hours, go to the cell nucleic acid detection point (113.66°E,34.71°N ) for nucleic acid detection' are a single ST-Voxel (Figure 21(b)); the modeling results of the directional relation 'south', distant relation '500 m', and fuzzy time 'around 22:10' in 'Around 22:10, go to 500 meters south of the entrance of Changjiang Garden Community to buy medicine' are multiple ST-Voxels (Figure 21(c)).

Figure 22 .
Figure 22.Visualization and statistical results of voxelized information association.(a) ST-Voxel visualization results; (b) Coincidence relative information.

Table 1 .
Ubiquitous spatiotemporal information types in texts.
Coordinates; place namesTemporal objects Time points; time periods; fuzzy time Spatiotemporal relation Spatial relation Topological relation; directional relation; distant relation Temporal relation Before; after; during; contains; overlaps; overlapped-by; meets; met-by; start; started-by; finished; finished-by; equals

Table 2 .
Example of the topological relation modeling process.

Table 3 .
Example of the modeling process for 9 types of temporal relation.