Wiki source code of 04. Technické informace

Version 5.1 by Branislav ŠIŠKA on 2023/03/21 11:19

Show last authors
1 = {{id name="04.Technickéinformace-1.Hardware"/}}1. Hardware =
2
3 The Clindata software runs on computer cluster located on Institute of Molecular and Translational Medicine (IMTM), Faculty of Medicine and Dentistry, Palacky University in Olomouc. The facility is secured and under global surveillance.
4
5 === {{id name="04.Technickéinformace-Descriptionofhardware"/}}Description of hardware ===
6
7 **Servers**
8
9 HPE DL385 Gen10 CTO Mod-X 8SFF Svr
10
11 8x HPE 16GB 2Rx8 PC4-2933Y-R
12
13 2x HPE DL385 Gen10 AMD EPYC 7302
14
15 2x HPE 240GB SATA RI SFF SC DS SSD
16
17 **Data Storages**
18
19 HP 3PAR data storage 700TB.
20
21 HP EML tape library
22
23 Object storage
24
25 **Firewall**
26
27 HP F1000-S-EI VPN Firewall
28
29 **~ **
30
31 = {{id name="04.Technickéinformace-2.Software"/}}2. Software =
32
33 === {{id name="04.Technickéinformace-Thesoftwarerequirements"/}}The software requirements ===
34
35 The only requirement for using the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software is Internet browser which supports HTML5 standard. The list of supported browsers:
36
37 * Chrome: (Current - 1) and Current
38 * Edge: (Current - 1) and Current
39 * Firefox: (Current - 1) and Current
40 * Internet Explorer: 11+
41 * Safari: (Current - 1) and Current
42 * Opera: Current
43
44 (Current means the last available version of given browser)
45
46 === {{id name="04.Technickéinformace-Programminglanguage"/}}Programming language ===
47
48 The main programming language used for development of the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) application is Java 8. Other technologies used for development are:
49
50 * Spring Framework v5.
51 * HTML, CSS
52 * JavaScript
53 * jQuery
54 * jQuery UI
55 * Bootstrap
56 * MathJS
57 * Datatables
58 * SQL
59 * Oracle database
60
61 === {{id name="04.Technickéinformace-Operationsystem"/}}Operation system ===
62
63 Operation system installed on production servers is **RedHat Enterprise Linux 7.4.**
64
65 === {{id name="04.Technickéinformace-Proxyserver"/}}Proxy server ===
66
67 The **Apache HTTP Server** is used as gateway from outside world to internal application running in the production server.
68
69 === {{id name="04.Technickéinformace-Applicationserver"/}}Application server ===
70
71 The **ClinData** application runs on **Apache Tomcat,** which is an open-source Java Servlet Container developed by the Apache Software Foundation
72
73 **~ **
74
75 = {{id name="04.Technickéinformace-3.Database"/}}3. Database =
76
77 === {{id name="04.Technickéinformace-TheClindataDatabase"/}}The (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) Database ===
78
79 The database used for storing data from the (% style="color: rgb(23,43,77);text-decoration: none;" %)**Clindata**(%%)** **software is **Oracle Database** (commonly referred to as **Oracle RDBMS**) which is produced by Oracle Corporation. Version of database is 12.1. Standard edition.
80
81 The Oracle database runs on separated Linux based server which si firewalled from external network (Internet) by hardware firewall. This database server is not accessible from outside of organization but only from enlisted inner servers (application and backup servers).
82
83 **~ **
84
85 = {{id name="04.Technickéinformace-4.Backup"/}}4. Backup =
86
87 There are more levels of data archiving to ensure data safety and quick database recovery. Data are archived on **database level** and **operation system level**
88
89 1. Database level backups
90 1*. **RMAN** utility is integral part of the Oracle database. It creates binary copy of whole database and stores it to filesystem. The RMAN utility is run **every week**. The files are stored internally on database server and are copied to two independent backup sites.
91 1*. **EXPDP/IMPDP** is data pump exporting data into text base backups. The EXPDP utility is run **every 4 hours**. The backup target is the same as with RMAN. It is stored to two independent backup sites.
92 1*. **Redo Logs** are archived **every day** to filesystem.
93 1. Operation system backups
94
95 * **IBM Tivoli Storage Manager** (TSM Admin) is enterprise solution from IBM for backups and recovery of physical or virtual servers. The backup created by TSM Admin includes redo logs, RMAN and EXPDP exports. It runs **every day** and the backup data is stored to disk array.
96
97 RMAN configuration file
98
99 CONFIGURE RETENTION POLICY TO REDUNDANCY 1; # default
100 CONFIGURE BACKUP OPTIMIZATION OFF; # default
101 CONFIGURE DEFAULT DEVICE TYPE TO DISK; # default
102 CONFIGURE CONTROLFILE AUTOBACKUP ON;
103 CONFIGURE CONTROLFILE AUTOBACKUP FORMAT FOR DEVICE TYPE DISK TO '%F'; # default
104 CONFIGURE DEVICE TYPE DISK PARALLELISM 1 BACKUP TYPE TO BACKUPSET; # default
105 CONFIGURE DATAFILE BACKUP COPIES FOR DEVICE TYPE DISK TO 1; # default
106 CONFIGURE ARCHIVELOG BACKUP COPIES FOR DEVICE TYPE DISK TO 1; # default
107 CONFIGURE MAXSETSIZE TO UNLIMITED; # default
108 CONFIGURE ENCRYPTION ALGORITHM 'AES128'; # default
109 CONFIGURE COMPRESSION ALGORITHM 'BASIC' AS OF RELEASE 'DEFAULT' OPTIMIZE FOR LOAD TRUE ; # default
110 CONFIGURE RMAN OUTPUT TO KEEP FOR 7 DAYS; # default
111 CONFIGURE ARCHIVELOG DELETION POLICY TO NONE; # default
112 CONFIGURE SNAPSHOT CONTROLFILE NAME TO '/../../oracle/12c/dbs/snapcf_imtm.f'; # default
113
114 EXPDP configuration file
115
116 DIRECTORY=dtpump
117 DUMPFILE=registry.dmp
118 LOGFILE=registry.log
119 CONTENT=ALL
120 COMPRESSION=NONE
121 JOB_NAME=registry_migration
122 SCHEMAS=registry,registry_aud
123
124 = {{id name="04.Technickéinformace-05.Secureconnection"/}}05. Secure connection =
125
126 === {{id name="04.Technickéinformace-Security"/}}Security ===
127
128 As the (% style="color: rgb(23,43,77);text-decoration: none;" %)**Clindata**(%%)** **application is **web-based** application there is need to **secure communication** between **server** and **client's computer**. It is done by using **HTTPS** communication protocol which is encrypted using Transport Layer Security **(TLS).** This protocol is widely used for all secure transactions on the Internet (payment, emails etc.) and is considered as safe and unbreakable. It protects against man in-the middle attacks. Communication without the security layer (HTTP) is can be interfered by attackers, they can listen to it or change it.
129
130 === {{id name="04.Technickéinformace-Securityredirection"/}}Security redirection ===
131
132 All user requests coming via unsecured **HTTP** protocol are automatically **redirected** to secure **HTTPS** protocol. All communication between client and server is secured and there is no way how to connect to the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software via unsecured connection.
133
134 === {{id name="04.Technickéinformace-Certificate"/}}Certificate ===
135
136 The secured communication requires a certificate stored on the web server. The certificate must be signed by **trusted certificate authority**. The (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) server uses the certificate digitally signed by **TERENA** authority.
137
138 **~ **
139
140 = {{id name="04.Technickéinformace-06.Authenticationandauthorization"/}}06. Authentication and authorization =
141
142 === {{id name="04.Technickéinformace-Usersadministration"/}}Users administration ===
143
144 All user using the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) application must be registered before they can log in. There is no possibility to get unauthorized access to the server even for some demonstration purposes. There is a specialized application for user management - IMTM Admin tool.
145
146 The **Admin tool** is responsible for:
147
148 * Management of **institutions, companies, hospitals** and their departments. There can be unlimited number of organization levels, for example a university can have such structure university-faculty-department-laboratory. Each organization level can obtain different set of privileges and roles.
149 * Management of **users**. Every user is identified by email address as login and password. Users are assigned to their organizations. Users can work on more projects with different roles. It is allowed by** user profiles**. Number of profiles for a user is not limited. Each profile can have different set of **privileges and roles**.
150 * Management **roles and profiles**.
151
152 The Admin database with user's data is stored in the Oracle database as separated schema. Access to this schema is restricted only for admin users. The server with the Oracle database is firewalled out of public network and not accessible from Internet.
153
154 An account for new user can be created **only by administrator**. There is no way that user could create his account on its own.
155
156 These steps must be followed to **create new account**:
157
158 * New user asks a project owner to create new account
159 * The project owner asks administrator to create new account with specified privileges and roles
160 * The administrator creates new account and sets required privileges and roles
161 * The project owner checks account setting and approve it.
162 * New user receives his credentials and can log in.
163
164 === {{id name="04.Technickéinformace-Centralauthenticationservice(CAS)"/}}Central authentication service (CAS) ===
165
166 The (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) application must be connected with data from the IMTM Admin to control accounts, roles and privileges. It is done by integrating of the CAS technology into the ClinData software. The CAS technology consists of CAS Server and CAS Client.
167
168 The CAS server is responsible for authenticating users and granting accesses to applications. The CAS clients protect the CAS applications and retrieve the identity of the granted users from the CAS server.
169
170 = {{id name="04.Technickéinformace-07.PrivilegesandRoles"/}}07. Privileges and Roles =
171
172 === {{id name="04.Technickéinformace-Accessrestrictions"/}}**Access restrictions** ===
173
174 A user access can be restricted in two different areas:
175
176 * restriction in **access to** ClinData **functionality**
177 * restriction in **access to data** stored in the ClinData software
178
179 All restriction is set in the **IMTM Admin tool**.
180
181 === {{id name="04.Technickéinformace-Functionalityrestrictions"/}}Functionality restrictions ===
182
183 **Privileges**
184
185 Access privileges determine which (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) objects a user can browse or edit. Each functionality in the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software is reflected in corresponding privilege so the access to everything is controlled. Any user or group of users can have access to any privilege granted or restricted.
186
187 The picture shows schema of privileges in the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software.
188
189 **Roles**
190
191 Roles are virtual entities which serve as container for more privileges.
192
193 There are predefined roles and users, or groups of users can be assigned to them. The most frequently used roles are:
194
195 * ClinData system admin - full access to all functions in ClinData, no restrictions, creating new project
196 * ClinData project admin - full access to all function in selected project including study designer
197 * ClinData project data manager - access to all functions needed to insert new/update patient’s data.
198 * ClinData project data monitor - access to all functions needed for study monitoring, validation and finishing CRFs.
199 * ClinData project data browser - read only access to selected data.
200
201 \\
202
203 === {{id name="04.Technickéinformace-Datarestrictions"/}}Data restrictions ===
204
205 Default setting for accessing of data in the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software is maximally restricted. A user can see only data he inserted himself. By default, he doesn't see any data inserted by any other user. Access to any other data must be explicitly permitted.
206
207 These options can be set:
208
209 * user can see only his data
210 * user can see data inserted by other user or group of users
211 * user can see data linked with an organization
212 * user can see all data in a study
213
214 === {{id name="04.Technickéinformace-Personaldata"/}}Personal data ===
215
216 There can be studies or registers which contain personal data. Access to this data can be restricted by special privilege.
217
218 These options can be set:
219
220 * user can see personal data
221 * user can't see personal data
222
223 \\
224
225 = {{id name="04.Technickéinformace-8.Logging"/}}8. Logging =
226
227 The ClinData software **records everything** happening in the system. Admin user can browse these records in user friendly way and analyze potential problems, watch user activities etc.
228
229 There are three different types of logging mechanisms:
230
231 * **Software logging** is done on programming language level and is very detailed. The log files contain data about internal state of the whole system in time of logging event. This approach is designed for detailed analyses of problems which happened in past.
232 * **Access logging** is designed for controlling of user’s activities. The access record contains data about who did an action and when. It logs all actions done on all objects in the system. Object can be study, patient, CRF form, file. These actions are logged:
233 ** create
234 ** open
235 ** change
236 ** add
237 ** remove
238 ** delete
239 ** export
240 * **Auditing** is focused to changes done in CRF forms. It records complete history of what was changed by users. One record contains data about:
241 ** when the change was done
242 ** who changed the data
243 ** what was changed
244 ** what is the new value
245
246 The important information is that the ClinData software** doesn't delete any record**. Every record in the database has system **flag ACTIVE**. Deleting of the row just sets this **ACTIVE** flag to **false**. The inactive rows are not displayed in the ClinData software but are still stored in the database.
247
248 **~ **
249
250 = {{id name="04.Technickéinformace-9.Softwaredevelopment"/}}9. Software development =
251
252 === {{id name="04.Technickéinformace-Issuetracking"/}}Issue tracking ===
253
254 Any problem found in the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software is documented and created as a **new issue in JIRA software**. JIRA software is developed by Atlassian and is an issue tracking tool. The new issue is analyzed, and priority is assigned.  The list of issues is sorted by priorities and processed by developers. When a serious problem is fixed then it is published in new version of the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software. The issue is also closed as done in JIRA.
255
256 === {{id name="04.Technickéinformace-Changesmanagement"/}}Changes management ===
257
258 All requests for changes planned in the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software are stored in JIRA. When a new request is coming then it is analyzed, time estimation is done, and priority assigned. The list of issues is sorted by priorities and processed by developers.
259
260 === {{id name="04.Technickéinformace-Versioning"/}}Versioning ===
261
262 The source code of the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software is stored in** GIT repository** which allows tracking of changes in files. There is possibility to browse history of any source code file in the repository. Every change is also documented so it is easy to understand the development cycle.
263
264 === {{id name="04.Technickéinformace-Codereview"/}}Code review ===
265
266 Any change done in source code of the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software must be **reviewed** by another developer. This process is called **code review**. This process minimizes number of bugs in source code because everything is double checked. **Bitbucket software** (developed by Atlassian) is used for code reviews. It prevents developers from using not proven code in public versions of the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software.
267
268 **~ **
269
270 = {{id name="04.Technickéinformace-10.Qualityassurance"/}}10. Quality assurance =
271
272 === {{id name="04.Technickéinformace-Testingenvironment"/}}Testing environment ===
273
274 All new versions of the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software must be tested and proven as functional and correct before they are published. There is a special environment which is used form testing of the new version before it is published. The testing environment must be similar to production environment to avoid configuration issues.
275
276 === {{id name="04.Technickéinformace-Unittesting"/}}Unit testing ===
277
278 Unit testing is a software testing method by which individual units of source code are tested to determine whether they are fit for use. There are actually more than one thousand-unit tests in the source code of the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software. All critical parts of the source code are covered by unit test close to 100%. Overall source code is covered by unit test by more than 85%. Any problem in unit testing is blocker for publishing of the version of the software.
279
280 === {{id name="04.Technickéinformace-Applicationtesting"/}}Application testing ===
281
282 The whole application is tested by application exploratory testing before it is published. The application testing is done in testing environment. Any problem in application testing is blocker for publishing of the version.
283
284 === {{id name="04.Technickéinformace-Publishing"/}}Publishing ===
285
286 Publishing process means that a new version of the (% style="color: rgb(23,43,77);text-decoration: none;" %)Clindata(%%) software is being released and made accessible for users. The Bamboo software (developed by Atlassian) is used for building and publishing new versions. Unit testing is also involved in publishing of the new version. In case of any problem in any unit test the whole publishing, process is interrupted, and an notification email is sent to responsible persons.