Introduction
Facial nerve paralysis (FNP) appears as a weakness or loss of facial muscle expression due to damage of the facial nerve. FNP with incomplete recovery significantly influences patients’ quality of life. Facial asymmetry may induce a disturbance of social relationships due to a cosmetic problem and inappropriate eye care or perioral muscular recovery may present functional disturbances. FNP patients require support from many medical specialists, such as neurologists, otolaryngologists, ophthalmologists, plastic surgeons, psychiatrists, and physiotherapists, to appropriately manage FNP [
1]. Although patients with acute peripheral FNP (such as Bell’s palsy) achieve a high rate of complete remission, some fail to achieve recovery and suffer from sequelae after FNP. Early diagnosis and proper grading are crucial in determining the prognosis of FNP and in optimally managing the disease.
Various grading systems have been introduced to assess degrees of FNP. The House-Brackmann grading system and Fisch scaling are examples of the most widely used tools. However, an objective, quantitative, and integrated FNP assessment tool has not been established to date. The complexity of assessment, lack of objectivity and reproducibility, test-retest reliability, and inter-observer differences were suggested as problems of these grading systems. Studies have explicitly attempted to improve the reproducibility, result consistency, and ease of performance of FNP assessment tools [
2,
3]. The objective and consistent assessment of FNP requires minimizing inter-observer variation.
The present study compared the results of a PC-based facial asymmetry assessment program (PC-FAAP) developed to objectively calculate and standardize FNP grading with subjective regional scoring by raters in three facial expression components (smiling, forced closing eyes, and raising eyebrows) for patients with acute unilateral peripheral FNP.
Subjects and Methods
This study was approved by the Institutional Review Board (IRB) of the Seoul National University Boramae Medical Center (IRB No. 16-2016-83). Informed consent was obtained from all participants.
Facial photographs
The facial photographs of the participants were acquired using a camera with a 1334×750 resolution. The frontal photographs of the participants standing at a consistent distance were taken using four facial movements as FNP static views: “At rest,” “Large smile showing teeth,” “Closure of eyes tightly,” and “Elevation of eyebrows” [
4-
6].
PC-FAAP
Facial photographs (resting, mouth extension, forced eye closing, and eyebrow lifting) of 8 healthy people and 16 FNP patients were used. Participants were divided into 3 groups in accordance with the House-Brackmann grading (H-B): group I (normal)=H-B grade I-II, group II (mild to moderate FNP)=H-B grade III-IV, and group III (severe FNP)=H-B grade V-VI. The facial asymmetry was evaluated using PC-FAAP, which we previously developed. This program determined primary facial landmarks and tracked them with an incremental parallel cascade of the linear regression method proposed by Asthana, et al. [
7]. The localization algorithms of facial landmarks, data acquisition, and analysis methods were determined as described in our previous study [
8]. We defined horizontal and vertical lines to assess facial asymmetry. The horizontal line was defined as an extended line connecting each medial canthus. The vertical line was defined as a line perpendicular to the horizontal line at a mid-point between both medial canthus points.
The mouth asymmetry ratio (MAR) was calculated by comparing both sides of the tracking pointed lip area centered around the vertical midline (
Fig. 1, upper). The eyebrow asymmetry ratio (EAR) was measured using the vertical distance from the midpoint of the medial and lateral canthal line to the upper margin of the eyebrow when lifting the eyebrows (
Fig. 1, middle). The complete eye closure asymmetry ratio (CAR) was estimated using the area between a point-tracked low eyeline and the upper margin of the eyebrow with an identical and parallel distance from each medial canthal point (
Fig. 1, lower). A denominator of each ratio in a normal healthy group was determined as the larger of the values on both sides. Our previous study revealed that the perception of the most unnatural facial area in FNP patients was the mouth, followed by the eyes and the eyebrows [
9]. Accordingly, MAR, CAR, and EAR were weighted with a ratio of 5:3:2 to determine the FNP grading score (FGS) while considering the results of perception of facial asymmetry according to the facial area found in our previous study [
9].
We compared differentiation in facial asymmetry using this program for three groups: normal, mild to moderate, and severe FNP. To evaluate the consistency of the results, the three scores of each subject consecutively measured by PC-FAAP were compared to the visual analogue scale (VAS) scores measured by three otologists who were not aware of this study. They were surveyed to rate the grade of FNP compared to the unaffected side on a 100-scale score for mouth extension, forced eye closing, and eyebrow lifting. VAS 100-score indicated that there is no asymmetry. Additionally, 100-scaled scores for overall FNP grade compared to the unaffected side were also presented.
Statistical analysis
The MAR, EAR, CAR, and FGS were compared between groups using the Mann-Whitney U test. The correlations between resting and expressed MAR or EAR were investigated using Pearson correlation analyses. A variation analysis was performed using one-way ANOVA. The SPSS 21.0 statistical package was used for the analysis (IBM Corp., Armonk, NY, USA). p values ≤0.05 were considered statistically significant.
Discussion
Botman and Jongkees [
10] introduced the first FNP scaling system with 5 grades in 1955. In 1983, House-Brackmann [
11] developed a new scale to identify the status and recovery of facial nerve damage after removing vestibular schwannomas, which has been a widely used scale of FNP to date. However, this well-made scaling system is being used in various diseases showing FNP such as Bell’s palsy, Ramsay-Hunt syndrome, and FNP by infection, trauma, or tumor. There are also at least 10 or more scaling systems [
11,
12]. Most scaling systems have some drawbacks. Among others, the complexity of assessment methods and non-objectivity of rater measurements appear to be the most severe weakness for existing FGS systems. A lack of objectivity in grading FNP may complicate comparing functional changes and sharing accurate information. Measuring and scaling FNP by an automatized program, not by raters, can reduce the individual difference that raters make.
We propose an FGS for patients with FNP using a PC-based program in three facial expression components: large smile showing teeth, closure of eyes tightly, and elevation of the eyebrows, without constraints from the recording environment. Although the survey study reported “Closure of eyes gently” instead of “Closure of eyes tightly” as a view described in common grading scales, we chose to use “Closure of eyes tightly,” which can better reflect the function of the orbicularis oculi [
4,
6]. Although the present study did not include various variables such as static, dynamic, synkinesis, and facial area, this preliminary scaling could contribute to early diagnosis and treatment in patients with FNP when considering the current feasibility of wireless communication. In addition, this integrated assessment of facial asymmetry at facial expression points may be helpful in evaluating the severity of FNP and improving existing FNP grading systems.
The PC-based program used in the present study was developed to assess FNP patients using clear and straightforward methods and to avoid interrater unreliability. It is relatively easy to design and modify modules using this program. Additionally, because facial photographs of patients with FNP are entered into the program and analyzed, we could plan to establish remote medical examination and treatment in further research. A PC-based program can be used by non-practitioners to enable early FNP detection and diagnosis. Our study demonstrated that severe FNP patients were the best candidates for PC-based analysis. Although participants with normal facial expressions also showed more consistent results with PC-based analysis, we think that the system is not useful for treating FNP in this context; however, it may be meaningful if used in aesthetic procedures.
Although there was a small number of participants in each group, two significant results should be emphasized. First, the PC-based assessment was significantly consistent in measuring eyebrow elevation. As shown in
Fig. 2, the MAR was also more consistent in PC-FAAP; however, we think that statistical significance was not attained due to the small number of patients. Notably, the maximal eye closure assessment was approximately the same between PC-FAAP and the subjective analysis in terms of consistency. Consistency in the subjective analysis of facial expression has been reported to be best for eye closure [
13]. Therefore, we could conclude that PC-based assessment is more advantageous in assessing eyebrow or mouth movement and that subjective assessment can be as consistent as a PC-based assessment. However, there were no significant differences in the consistency of the subgroup analysis. Considering the small number of patients included, a larger cohort study would reveal significant differences between groups II and III.
Second, the PC-based and subjective assessments were not significantly different in mild to moderate FNP patients with regard to consistency, while a PC-based assessment was superior in severe FNP patients (
Fig. 3 and
4). This was contrary to our expectations when we started this study. We hypothesized that PC-FAAP might help consistently identify and differentiate mild FNP; however, our study showed that PC-FAAP had comparable variation in assessing mild to moderate FNP. Moreover, the subjective assessment was more consistent in CAR, and the superiority of the subjective assessment found in evaluating eye closure can be a possible reason. Although the exact reason for this finding was not be discussed further, PC-FAAP is more appropriate and advantageous in measuring facial expressions in severe FNP cases, and further studies are needed to improve the consistency of PC-FAAP in mild to moderate FNP patients. Statistical significance was not attained in severe FNP (group III); however, MAR, CAR, and EAR showed better consistency with PC-FAAP. We think that a study with a larger cohort may show statistical significance.
Elderly subjects with severe FNP may have a substantial confounding factor when evaluating facial nerve function by pixel subtraction using a PC program [
14]. The authors suggested that their skin laxity and facial hypomobility can induce errors in this system. In addition, the weighting of facial grading variables may be arbitrary, although the ratio used in our study is based on our previous study [
9]. People reported that the most unnatural facial areas (unnatural facial expression) in FNP patients were the lip (smile) followed by the eye (eye closure), cheek (whistling), and eyebrow (frowning). However, this is based on subjective assessment, and weighting variables can be changed by racial, ethnic, or cultural differences. Further studies regarding weighting on each area of the paralyzed face will indicate more representative scores. Facial grading variables, such as MAR and EAR at rest status, were not significantly different between groups. This may mean that PC-FAAP is not sensitive to FNP at rest, which should be refined in the future.
The present study has some limitations. 1) Facial asymmetry evaluation in FNP patients was conducted in a single center. Further study in multiple centers may redeem the variations generated by the calibration. 2) This study was limited to the static measurement of faces with FNP. Further developing PC-FAAP using facial videos may improve the accuracy of FNP grading variables. 3) Because PC-FAAP is a tentative program, a supplemental point tracking technique is needed for the accurate detection of facial asymmetry. 4) PC-FAAP did not analyze other considerations, including synkinesis, in this study. Further studies should apply these to obtain more detailed results. 5) Bilateral FNP cannot be evaluated with this program, as is the case with most other assessment tools. This may be another problem because the present grading systems depend on comparisons with the non-affected side [
15].
The following are required to commoditize the FNP assessment tool: 1) Simplification of the assessing method, 2) Qualification and objectification of scaling, and 3) Establishment of consensus for the sharing of practitioners. To fulfill this, further studies that measure and analyze various conditions of the FNP should be performed. Additionally, the authorized organization should conduct verification research to assess programs that show effectiveness.
In conclusion, the PC-based assessment of FNP can avoid objective scoring data bias by raters. This assessment may be more advantageous than eyebrow asymmetry and integrated FGS, and the grade of severely affected FNP can be assessed more consistently by PC-FAAP. An objective grading system using PC may be more appropriate in severe FNP patients than mild to moderate FNP patients. Additional studies with more patients may further reinforce the functions of PC-FAAP.