ECML/PKDD-2005 Workshop on Discovery ChallengeGuide to the Hepatitis Dataset Shusaku Tsumoto Shoji Hirano email@example.com firstname.lastname@example.org Department of Medical Informatics, Faculty of Medicine,Shimane University, Japan
Hepatitis Dataset: Basic Information • Donators • Dr. Katsuhiko Takabayashi and Dr. Hideto Yokoi Department of Medical Informatics, Chiba University Hospital, Japan • Subjects • 771 patients of type B and C chronic virus hepatitis who took examinations at Chiba University Hospital during 1982 and 2001. • Contents • Biopsy information: dates and results • stage of fibrosis, virus activity • Period of applying interferon therapy • Laboratory Examinations • Blood tests (Biochemical, Hematological, DNA), Urinalysis • Basic information of patients • MID (masked ID), Sex, Date of Birth
Hepatitis Dataset: Structure • 7 tables in CSV format • An MID (masked ID) is used to relate information in different table Temporal data Stationary data Basic Patient Information (771) In-hospital Exam Results (1,565,876) MID Interferon AdministrationInformation (198) Out-hospital Exam Results(31,040) Biopsy Information (961) Results of Hematological Analysis (42,914) * Description of Examinations (459) *Added on July 4, 2003.
Goals of the Challenge • Evaluate whether the laboratory examinations can be used to estimate the stage of liver fibrosis. • If possible, it can be used as an alternate of invasive biopsy. • Discover the differences in temporal patterns between hepatitis B and C. • Evaluate whether the interferon therapy is effective or not. • High-cost therapy; but it is not effective for all patients. • Can we forecast its effectiveness in advance using the results of laboratory examinations ? • Validate the following hypothesis regarding GOT and GPT. • (GOT or GPT) x (time) = (stage of liver fibrosis)