To explain the concept of polarization of a charge distribution, it is noted that an electric field E is a vector, which by definition "pushes" a positive charge in the direction of the vector and "pulls" a negative electric charge in opposite direction (against the direction of E). Because of this "push-pull" effect the field will distort the charge-distribution ρ, with a build-up of positive charge on that side of ρ to which E is pointing and a build-up of negative charge on the other side of ρ. One calls this distortion the polarization of the charge-distribution. Since it is implicitly assumed that ρ is stable, there are internal forces that keep the charges together. These internal forces resist the polarization and determine the magnitude of the polarizability.
The concept of polarizability is very important in atomic and molecular physics. In atoms and molecules the electronic charge-distribution is stable, as follows from quantum mechanical laws, and an external electric field polarizes the electronic charge cloud. The amount of shifting of charge can be quantitatively expressed in terms of an induced dipole moment.
Am electric dipole of a continuous charge-distribution is defined as
If there is no external field we call the dipole permanent, written as pperm. A permanent dipole moment may or may not be equal to zero. For highly symmetric charge-distributions (for instance those with an inversion center), the permanent moment is zero.
Under influence of an electric field the charge-distribution will distort and the dipole will change,
where pind is the induced dipole, i.e., the change in dipole due to the polarization of the charge-distribution. Assuming a linear dependence in the field, we define the polarizability by the following expression
This relation can be generalized to higher powers in E (in the general case one uses a Taylor series), the polarizabilities arising as factors of E2, and E3 are called hyperpolarizabilities and hyper-hyperpolarizabilities, respectively.
The relation above is valid when the vector p is parallel to the vector E, i.e., α is a single real number, a scalar. It can happen that the two vectors (cause and effect) are non-parallel, in that case the defining relation takes the form
By writing these two vectors in component form we implicitly assumed the presence of a Cartesian coordinate system. The polarizability α is expressed with respect to the very same coordinate system by a matrix,
We know that choice of another Cartesian basis (coordinate system) changes the column vectors pind and E, while the physics of the situation is unchanged, neither the electric field, nor the induced dipole changes, only their representation by column vectors changes. Similarly, upon choice of another basis the polarizibility α is represented by another 3×3 matrix. This means that α is a second rank (because there are two indices) Cartesian tensor, the polarizability tensor of the charge-distribution.
From the defining equation follows that p has the dimension charge times distance, which in SI units is C m (coulomb times meter). In Gaussian units this is statC cm (statcoulomb times centimeter). An electric field has dimension voltage divided by distance, so that in SI units E has dimension V/m and in Gaussian units statV/cm. Hence the dimension of α is
|SI:||C m2 V−1|
|Gaussian:||statC cm2 statV−1 = cm3,|
where we used that in Gaussian units the dimension of V is equal to statC/cm (because of Coulomb's law). In Gaussian units the polarizability has dimension volume, and accordingly polarizability is often considered as a measure for the size of the charge-distribution (usually an atom or a molecule).
The conversion between the two units is:
Sometimes one defines the polarizability in SI units by the equation
This definition has the advantage that α'SI has dimension volume (m3). Clearly
where the power of ten is due to converting from m to cm. Sometimes one also encounters the definition
which gives a polarizability α" with dimension volume and a factor 4π larger than α′.
The energy of a dipole in an infinitesimal field is given by
where the dot indicates a dot product between the vectors. Integration to finite E gives
The second term becomes for a non-isotropic polarizibility in three different, but fully equivalent, notations,
Quantum mechanical expression
Classically, electric charge distributions, such as atoms and molecules, were known to exist, but the classical Maxwell theory could not explain their stability. The empirically known polarizability was likewise unexplainable. This changed after the advent of quantum mechanics. By means of the quantum mechanical technique of perturbation theory one can derive an expression for the induction energy Uind. One introduces a perturbation operator for a system of N particles:
where qk is the charge of the kth particle and rk its position vector (expressed with respect to some Cartesian coordinate system). Clearly, the dipole operator is defined by
In perturbation theory one assumes that the unperturbed (without external field) Schrödinger equations are solved
That is, we assume that all states and corresponding energies are known. Further it is assumed that the states constitute an orthonormal basis for the vector space they belong to. The second-order perturbed energy is
Comparing the second-order energy U(2) with the induction energy Uind gives a quantum mechanical expression for the polarizability tensor:
When a charge-distribution is hit by a monochromatic electromagnetic wave with electric component Ecosωt the polarizibility becomes a function of the angular frequency
where the dipole operator μ is defined above. Time-dependent perturbation theory leads to the following expression,
The quantity |α(ω)|2 is proportional to the cross section for elastic light scattering (Rayleigh scattering), and with a small modification it also gives the cross section for inelastic light scattering (Raman scattering).
The index of refraction n of a charge-distribution is related by the Lorentz-Lorenz relation to its frequency-dependent polarizability α(ω) and hence it follows that n is a function of ω. This leads to the phenomenon of dispersion of light (occurrence of rainbows).
The function α(iω) of imaginary frequency gives rise to one of the components of intermolecular forces, namely dispersion (London) forces.