Published November 6, 2024 | Version v1
Dataset Open

(Un)Fair Process Mining Event Logs (Converted to OCEL)

Description

Converted to OCEL 1.0 JSONOCEL and OCEL 2.0 XML from traditional event logs available at: Zenodo - Record 8059489.

Object Types: { Person }

Person-level Attributes:

  • (int) overallProtected: An attribute (0/1) indicating whether the person has experienced discrimination. (Note: If you're developing a fairness assessment algorithm, only use this attribute in the testing phase!)
  • (int) sumBoolDiscrFactors: Counts the number of possible discrimination factors that apply to the person.
  • (int) reworkedActivities: The total amount of rework involved in the person’s processing.
  • (float) throughputTime: The total processing time for a person.
  • (int) numOcc_ACTIVITY: Counts the number of times an activity occurs in the person’s lifecycle.

Event-level Attributes:

  • resource: The resource involved in processing a given person.

 

* Hiring

The data describes a multifaceted recruitment process with diverse application pathways ranging from minimal processing to extensive multi-step procedures. The variability of these routes, largely dependent on numerous determinants, yields a spectrum of outcomes from instant rejection to successful job offers.

The logs include attributes such as age, citizenship, German proficiency, gender, religion, and years of education. While these attributes may inform candidate profiles, their misuse could engender discrimination. Variables like age and education may signify experience and skills, citizenship and German language may address job logistics, but these should not unjustly eliminate applicants. Gender and religion, unrelated to job performance, must not sway hiring. Therefore, the use of these attributes must uphold fairness, avoiding any potential bias.

* Hospital

The data depicts a hospital treatment process that commences with registration at an Emergency Room or Family Department and advances through stages of examination, diagnosis, and treatment. Notably, unsuccessful treatments often entail repetitive diagnostic and treatment cycles, underscoring the iterative nature of healthcare provision.

The logs incorporate patient attributes such as age, underlying condition, citizenship, German language proficiency, gender, and private insurance. These attributes, influencing the treatment process, may unveil potential discrimination. Factors like age and condition might affect case complexity and treatment path, while citizenship may highlight healthcare access disparities. German proficiency can impact provider-patient communication, thus affecting care quality. Gender could spotlight potential health disparities, while insurance status might indicate socio-economic influences on care quality or timeliness. Therefore, a comprehensive examination of these attributes vis-a-vis the treatment process could shed light on potential biases or disparities, fostering fairness in healthcare delivery.

* Lending

This data illustrates the steps within a loan application process. From an initial appointment request, the process navigates various stages, including information verification and underwriting, culminating in loan approval or denial. Additional steps may be required, such as co-signer enlistment or collateral assessment. Some cases experience outright appointment denial, indicating the process's variability, reflecting applicants' differing credit situations.

The logs' attributes can aid in identifying influences on outcomes and detecting discrimination. Personal characteristics ('age', 'citizen', 'German speaking', and 'gender') and socio-economic indicators ('YearsOfEducation' and 'CreditScore') can impact the process. While 'yearsOfEducation' and 'CreditScore' can validly inform creditworthiness, 'age', 'citizen', 'language ability', and 'gender' should not bias loan decisions, ensuring these attributes are used responsibly fosters equitable loan processes.

* Renting

The data represents a rental process. It begins with a prospective tenant applying to view a property. Subsequent steps include an initial screening phase, viewing, decision-making, and a potential extensive screening. The process ends with the acceptance or rejection of the prospective tenant. In some cases, a tenant may apply for viewing but be rejected without the viewing occurring.

The logs contain attributes that can shed light on potential biases in the process. 'Age', 'citizen', 'German speaking', 'gender', 'religious affiliation', and 'yearsOfEducation' might influence the rental process, leading to potential discrimination. While some attributes may provide useful insights into a potential tenant's reliability, misuse could result in discrimination. Thus, fairness must be observed in utilizing these attributes to avoid potential biases and ensure equitable treatment.

Files

hiring_log_high.xml

Files (233.8 MB)

Name Size Download all
md5:22b0b23e6618a980d7c7a9b862d5bbed
21.0 MB Download
md5:839d12a7f6b97872bf3c82785f196222
34.3 MB Preview Download
md5:defdabe7eddb4cb2945d6d272efadb3a
20.7 MB Download
md5:87a95c507b6086d8b64b3a55202d040f
33.0 MB Preview Download
md5:9891fefa9287c5e77bac2ef82b62dac1
20.6 MB Download
md5:653056c8d1f41bc851402989f668f5a5
33.7 MB Preview Download
md5:5e2f525b06fe218dc443d6437ca00316
27.3 MB Download
md5:3142946b84506198078cfb629094bd9d
43.1 MB Preview Download