(Un)Fair Process Mining Event Logs (Converted to OCEL)
Creators
Description
Converted to OCEL 1.0 JSONOCEL and OCEL 2.0 XML from traditional event logs available at: Zenodo - Record 8059489.
Object Types: { Person }
Person-level Attributes:
- (int) overallProtected: An attribute (0/1) indicating whether the person has experienced discrimination. (Note: If you're developing a fairness assessment algorithm, only use this attribute in the testing phase!)
- (int) sumBoolDiscrFactors: Counts the number of possible discrimination factors that apply to the person.
- (int) reworkedActivities: The total amount of rework involved in the person’s processing.
- (float) throughputTime: The total processing time for a person.
- (int) numOcc_ACTIVITY: Counts the number of times an activity occurs in the person’s lifecycle.
Event-level Attributes:
- resource: The resource involved in processing a given person.
* Hiring
The data describes a multifaceted recruitment process with diverse application pathways ranging from minimal processing to extensive multi-step procedures. The variability of these routes, largely dependent on numerous determinants, yields a spectrum of outcomes from instant rejection to successful job offers.
The logs include attributes such as age, citizenship, German proficiency, gender, religion, and years of education. While these attributes may inform candidate profiles, their misuse could engender discrimination. Variables like age and education may signify experience and skills, citizenship and German language may address job logistics, but these should not unjustly eliminate applicants. Gender and religion, unrelated to job performance, must not sway hiring. Therefore, the use of these attributes must uphold fairness, avoiding any potential bias.
* Hospital
The data depicts a hospital treatment process that commences with registration at an Emergency Room or Family Department and advances through stages of examination, diagnosis, and treatment. Notably, unsuccessful treatments often entail repetitive diagnostic and treatment cycles, underscoring the iterative nature of healthcare provision.
The logs incorporate patient attributes such as age, underlying condition, citizenship, German language proficiency, gender, and private insurance. These attributes, influencing the treatment process, may unveil potential discrimination. Factors like age and condition might affect case complexity and treatment path, while citizenship may highlight healthcare access disparities. German proficiency can impact provider-patient communication, thus affecting care quality. Gender could spotlight potential health disparities, while insurance status might indicate socio-economic influences on care quality or timeliness. Therefore, a comprehensive examination of these attributes vis-a-vis the treatment process could shed light on potential biases or disparities, fostering fairness in healthcare delivery.
* Lending
This data illustrates the steps within a loan application process. From an initial appointment request, the process navigates various stages, including information verification and underwriting, culminating in loan approval or denial. Additional steps may be required, such as co-signer enlistment or collateral assessment. Some cases experience outright appointment denial, indicating the process's variability, reflecting applicants' differing credit situations.
The logs' attributes can aid in identifying influences on outcomes and detecting discrimination. Personal characteristics ('age', 'citizen', 'German speaking', and 'gender') and socio-economic indicators ('YearsOfEducation' and 'CreditScore') can impact the process. While 'yearsOfEducation' and 'CreditScore' can validly inform creditworthiness, 'age', 'citizen', 'language ability', and 'gender' should not bias loan decisions, ensuring these attributes are used responsibly fosters equitable loan processes.
* Renting
The data represents a rental process. It begins with a prospective tenant applying to view a property. Subsequent steps include an initial screening phase, viewing, decision-making, and a potential extensive screening. The process ends with the acceptance or rejection of the prospective tenant. In some cases, a tenant may apply for viewing but be rejected without the viewing occurring.
The logs contain attributes that can shed light on potential biases in the process. 'Age', 'citizen', 'German speaking', 'gender', 'religious affiliation', and 'yearsOfEducation' might influence the rental process, leading to potential discrimination. While some attributes may provide useful insights into a potential tenant's reliability, misuse could result in discrimination. Thus, fairness must be observed in utilizing these attributes to avoid potential biases and ensure equitable treatment.
Files
hiring_log_high.xml
Files
(233.8 MB)
Name | Size | Download all |
---|---|---|
md5:22b0b23e6618a980d7c7a9b862d5bbed
|
21.0 MB | Download |
md5:839d12a7f6b97872bf3c82785f196222
|
34.3 MB | Preview Download |
md5:defdabe7eddb4cb2945d6d272efadb3a
|
20.7 MB | Download |
md5:87a95c507b6086d8b64b3a55202d040f
|
33.0 MB | Preview Download |
md5:9891fefa9287c5e77bac2ef82b62dac1
|
20.6 MB | Download |
md5:653056c8d1f41bc851402989f668f5a5
|
33.7 MB | Preview Download |
md5:5e2f525b06fe218dc443d6437ca00316
|
27.3 MB | Download |
md5:3142946b84506198078cfb629094bd9d
|
43.1 MB | Preview Download |