Next: Fermi operator expansion Up: Appendix C Previous: Appendix C

Kernel polynomial method

 




Chebyshev Polynomials

Chebyshev polynomials are defined as follows

equation578

and like all orthogonal polynomials they obey a three terms recursion relation :

  eqnarray582

Any function f(x) with x in [-1,1] can be approximated by a truncated series of Chebyshev polynomials :

  equation595

or alternatively

  equation602

where the expansion coefficients are given by

   eqnarray611

The KPM uses (15) and (17) whereas the the FOE relies on (14) and (16).



Density of states

Let us consider the the density of states of our system, defined as :

equation639

where tex2html_wrap_inline2078 are the eigenvalues of tex2html_wrap_inline2080 . The first step is to define a rescaled Hamiltonian tex2html_wrap_inline2082 whose eigenvalues tex2html_wrap_inline2084 lie in the interval tex2html_wrap_inline2086 :

  equation647

where

equation653

Equation (19) is the operator equivalent of the change of variables

  equation661

which we will use thru out this paper.

This requires the knowledge of the find the lowest and highest eigenvalues of tex2html_wrap_inline2080 ( tex2html_wrap_inline2090 and tex2html_wrap_inline2092 respectively). This can be done very efficiently (actually in an O(N) way) using the Lanczos method.

One can define a scaled DOS which can be represented by a Chebyshev polynomial expansion

  eqnarray667

where the tex2html_wrap_inline2096 are called the moments and are defined by (17) (in which f(x) had been substituted with D(x)). In practice we only consider a finite number tex2html_wrap_inline1902 of moments in the series (23). Such an abrupt truncation of (23) would result in unwanted Gibbs oscillations, in analogy to Fourier series.

The important point made by Silver and co-workers[3] is to consider instead smooth truncations of the form :

  equation684

where the tex2html_wrap_inline2104 are called Gibbs damping factors. Those factors can be chosen in such a way as to impose certain properties to tex2html_wrap_inline2106 , like for instance smoothness, positiveness, etc. , depending on the application. Many choices of Gibbs damping factors are possible, for a more detail discussion of possible choices we direct the reader to Ref. [3]. In the context of this work we have restricted ourselves to the so-called Jackson damping factors which guarantee that tex2html_wrap_inline2106 is non-negative [3].

To understand the nature of the approximation introduced by the KPM one examine the relationship between the approximate DOS in (24) and the exact DOS. This can more easily be done after a change of variable to work in tex2html_wrap_inline2110 space where tex2html_wrap_inline2112 and the mth Chebyshev polynomial is simply tex2html_wrap_inline2116 . In tex2html_wrap_inline2110 space the approximated DOS is related to the exact DOS by a convolution :

  equation700

where tex2html_wrap_inline2120 is the kernel polynomial, which is an approximation to a Dirac function (normalized on tex2html_wrap_inline2122 )

equation708

which has a width proportional to tex2html_wrap_inline2124 . In Figure 2 we have plotted tex2html_wrap_inline1892 with and without Gibbs damping factors.

  figure717
Figure 2:  The kernel polynomial tex2html_wrap_inline1892 for tex2html_wrap_inline1894 . The dotted line represents the raw undamped polynomial, which exhibits strong Gibbs oscillations. The solid line is the kernel polynomial using Jackson damping factors which strongly suppresses the oscillations and enforces positivity.

The approximate DOS can therefore be seen as a smearing of the exact DOS with a know resolution function.

It is clear that (25) is very general and can be used to approximate any function f(x), by just replacing D and tex2html_wrap_inline2136 by f and tex2html_wrap_inline2140 , respectively.


Fermi energy, number of electrons and electronic energy

Assuming that we have calculated some tex2html_wrap_inline1902 moments (how this is done will be addressed in the next section) we now show how they can be used to determine the Fermi energy tex2html_wrap_inline2052 (or its rescaled equivalent tex2html_wrap_inline2148 ) and to calculate the electronic energy.

The number of electrons in the system can easily be found by integrating the DOS :

  eqnarray729

where tex2html_wrap_inline2150 is the tex2html_wrap_inline2110 -space equivalent of the Heaviside function and tex2html_wrap_inline2154 . Equation (27) can be seen as the definition of the Fermi energy tex2html_wrap_inline2052 .

The electronic energy of the system is defined as the energy integral over the occupied states of the DOS,

  eqnarray736

If we substitute the smeared DOS ( tex2html_wrap_inline2158 as defined in (25) for tex2html_wrap_inline2160 in (27) we get an approximation for the number of electrons (as a function of tex2html_wrap_inline2162 ) :

  equation746

with tex2html_wrap_inline2164 and tex2html_wrap_inline2166 . In practice this last expression will be used to determine the tex2html_wrap_inline2162 which gives the correct number of electrons : tex2html_wrap_inline2170 . In this respect one should point out the importance of choosing Gibbs damping factors which guarantee that tex2html_wrap_inline2158 is non-negative which in turn defines a unique solution for tex2html_wrap_inline2162 .

Similarly if we replace tex2html_wrap_inline2160 in (28) with tex2html_wrap_inline2158 we get the smeared-DOS (SD) approximation to the electronic energy :

equation760

An alternative expression to calculate the electronic energy is found by taking the kernel polynomial approximation to the Fermi-function in (28) (see equ. 25) :

equation774

This yields the smeared-Fermi (SF) approximation to the electronic energy :

equation779

It can be shown that the substitution of tex2html_wrap_inline2180 for tex2html_wrap_inline2182 in equation (27) yields the same expression for tex2html_wrap_inline2184 as in (29)[4].



Testing and comparison of the KPM and the FOE

  figure797
Figure 3:  Approximate DOS as a function of energy for a) 64 Si atoms calculated with a 100 moments and b) 64 atoms of Cu. The sharp vertical lines show the positions of the exact eigenvalues, with a height proportional to their degeneracy.

To illustrate the method presented in the previous section, we plotted in Figure 3a) and b) the approximate DOS at the tex2html_wrap_inline2186 -point as a function of energy for a 64 Si atoms system calculated with 100 moments and for 64 Cu atoms with 200 moments respectively. For comparison we also show the exact eigenvalues. One can clearly see that the approximated DOS is a smeared version of the exact DOS.

  figure807
Figure 4:  Comparison of the energy from the KPM vs the number of moments, calculated using the SD (circles) and the SF (diamonds) energy expressions to the exact energy (solid line). a) Electronic energy for 64 atoms of Si, b) Electronic energy for 64 atoms of Cu.

The KPM offers two expressions to compute the electronic energy given the moments tex2html_wrap_inline2096 and some Gibbs damping factors tex2html_wrap_inline2104 . We designate these as the smeared-DOS (SD) and the smeared-Fermi (SF) expressions respectively (see section C.3). In Figure 4 we compare the convergence of the two expressions with respect to the number of moments. Two observations can be made. First we see that the SF expression converges much faster than the SD expression. The reasons for this have been discussed in Ref.\ [3]. In light of this we will always use the SF expressions in our calculations. Second we see that we seem to have tex2html_wrap_inline2192 , and it can be shown that this is in general the case.

In Figure 5 we show the error in the total energy as a function of the number of moments for different systems, calculated using the KPM.

  figure822
Figure 5:  Convergence of the error in the electronic energy as a function of the number of moments for different systems. The error is defined as the difference between the exact electronic energy (obtained from the eigenvalues of the Hamiltonian) and the KPM approximation. All the energies where calculated using the SF expression. We see that the convergence for metals is generally slower.

The calculation for Si were done by using the Goodwin, Skinner and Pettifor TB model for Si, and the calculations for Al using the Harrison parameterization, both these models are orthogonal. The results for Cu and Mo were obtained using the non-orthogonal tight-binding model developed by Mehl and Papaconstantopoulos [5]. We see that for most systems the electronic energy is converged to within 10meV/atom (which is the generally accepted accuracy of the TB method) for less than 200 moments. We see that the convergence is slower in metallic systems. Although the convergence of the total energy is slow with respect to the number of moments, it should be stressed that the energy differences converge faster. To illustrate this point we have calculated the vacancy formation energy in bcc Mo. In Figure 6 we show how the total energy and the formation energy converge with respect to the number of moments.

  figure833
Figure 6:  Vacancy in bcc Mo. Convergence of energies as a function of the number of moments. a) Total energy for the perfect crystal, for the crystal with a defect and vacancy formation energy (difference of the two others). The dotted lines show the exact results. b) Energy differences for the same quantities as in a). We see that the error in the formation energy is much smaller.


Calculation of the moments

From a computational point of view, the most time consuming part of the KPM is the calculation of the moments tex2html_wrap_inline2096 . Using equation (17) the moments are defined by

eqnarray844

where we used the fact that the DOS may be written as a trace :

eqnarray856

If we introduce the following notation :

  eqnarray861

where the last equation is a consequence of (13), then one can express the moments as follow :

  eqnarray873

We see that the computationally intensive part of determining the moments is the calculation of the tex2html_wrap_inline2210 using (36), which involves one matrix-matrix-multiplication (MMM) for each value of m. One very useful feature of the KPM is that to calculate tex2html_wrap_inline1902 moments one requires tex2html_wrap_inline2216 MMMs.



Florian Kirchhoff
Tue Jun 9 16:34:36 EDT 1998