ORCA interface

ORCA is a general quantum chemistry program written and developed by Prof. Dr. Frank Neese and coworkers at the MPI für Kohlenforschung in Mülheim, Germany. The program is free for academic use, very easy to install (binary download only), one of the fastest GTO-based QM programs and is full of advanced methods, both DFT-based and WFT-based.

ASH features a highly flexible interface to ORCA and is in fact the earliest interface written, due to our long experience with the program. Energies and gradients are available in the interface so ORCATHeory in ASH can be used for single-point energies, geometry optimizations, numerical frequencies, surface scans, NEB and molecular dynamics within ASH (without relying on any of the algorithms within ORCA). Additionally, ASH can call on ORCA to calculate a TDDFT or deltaSCF gradient which enables excited-state optimizations/MD. Full QM/MM pointcharge-support is available so the interface can be used fully in QM/MM jobs. ASH can call on ORCA to calculate an analytic Hessian.

Since the arrival of the JSON interface in ORCA 6.0 ASH now also supports various ways of conveniently reading WF data from ORCA via the JSON-file (created from a GBW file). This means that one can get easy access to the 1-electron/2-electron integrals, MO coefficients, densities etc. directly within the ASH-ORCA interface.

Other features of the interface:

Convenient ways of specifying convergence to a broken-symmetry state
Specifying of atom-specific basis sets
Flexible options to specify guess orbitals
Automatic grabbing of ORCA errors and warnings
Printing of population analysis in each Opt/MD step.
Definitions of fragments

See ORCA manual . See also ORCA forum ORCA forum .

ORCATheory class:

class ORCATheory:
    def __init__(self, orcadir=None, orcasimpleinput='', printlevel=2, basis_per_element=None, extrabasisatoms=None,
                extrabasis=None, atom_specific_basis_dict=None, ecp_dict=None, TDDFT=False, TDDFTroots=5, FollowRoot=1,
                orcablocks='', extraline='', first_iteration_input=None, brokensym=None, HSmult=None, atomstoflip=None,
                numcores=1, nprocs=None, label="ORCA",
                moreadfile=None, moreadfile_always=False, bind_to_core_option=True, ignore_ORCA_error=False,
                autostart=True, propertyblock=None, save_output_with_label=False, keep_each_run_output=False,
                print_population_analysis=False, filename="orca", check_for_errors=True, check_for_warnings=True,
                fragment_indices=None, xdm=False, xdm_a1=None, xdm_a2=None, xdm_func=None, NMF=False, NMF_sigma=None,
                cpcm_radii=None, ROHF_UHF_swap=False,
                deltaSCF=False, deltaSCF_PMOM=False, deltaSCF_confline=None, deltaSCF_turn_off_automatically=True):

Keyword	Type	Default value	Details
`orcadir`	string	None	Path to ORCA directory.
`orcasimpleinput`	string	''	Definition of the ORCA simple-input line
`orcablocks`	string (multiline)	''	Used for block-input in the ORCA inputfile.
`extraline`	string	''	Additional inputfile-option for ORCA.
`printlevel`	integer	2	How much output printed by the ORCA module.
`extrabasisatoms`	list	None	What atomindices should have a different basis set (gets added to coordinate block)
`extrabasis`	string	None	What the basis set on extrabasisatoms should be
`TDDFT`	Boolean	False	Whether to do TDDFT or not. If part of a Gradient job or Optimization job then the excited state gradient is calculated and used.
`TDDFTroots`	integer	5	How many TDDFT roots to calculate if TDDFT=True
`FollowRoot`	integer	1	What excited state root to calculate gradient for if TDDFT=True.
`brokensym`	Boolean	False	Whether to do a Flipspin ORCA calculation to find a BS solution. Requires HSmult and atomstoflip options.
`HSmult`	integer	None	What high-spin multiplicity to use in a brokensym=True job.
`atomstoflip`	list	None	What atom indices to spin-flip.
`moreadfile`	string	None	Name of file or path to file of a GBWfile to read in to the ORCA calculation
`moreadfile_always`	Boolean	False	Whether moreadfile option is constantly applied for all runs using this ORCATheory object or only for first run. Default: False meaning moreadfile is only used for first run using ORCATHeory object.
`autostart`	Boolean	True	Whether ORCA will automatically try to read orbitals from a GBW file with same basename.
`numcores`	integer	1	Number of cores to use for ORCA
`filename`	string	'orca'	Name of inputfile and outputfile
`label`	string	None	Label for ORCA object. Useful if working with many ORCATheory objects to distinguish them.
`propertyblock`	string	None	String containing ORCA-block input (e.g. %eprnmr) that must come after the coordinates.
`keep_each_run_output`	Boolean	False	Whether to keep copy of each ORCA outputfile from each run-call (e.g. each Opt-step).
`print_population_analysis`	Boolean	False	Whether to print Mulliken population analysis for each step
`print_population_analysis`	Boolean	False	Whether to print Mulliken population analysis for each step
`check_for_errors`	Boolean	True	Whether to check for errors in ORCA output once ORCA calculation is done.
`check_for_warnings`	Boolean	True	Whether to check for warnings in ORCA output once ORCA calculation is done.
`fragment_indices`	list of lists	None	Optional: list of lists of atom indices that specify whether atoms belong to a specific ORCA fragment (e.g. for ORCA multi-level PNO calculations). Example: [[1,2,3],[10,11,12],[13,14,15]]. Will affect the coordinate-block in the ORCA inputfile. For QM/MM: atom indices must be in QM-region.
`cpcm_radii`	list of floats	None	By providing a list of radii (in Å) for each atom in the molecule, the CPCM radii will manually be changed in the ORCA inputfile. Typically used with DRACO-radii Helper-programs interfaces

Finding the ORCA program

ASH can find the ORCA program in a few different ways.

ASH will first check if the orcadir argument has been set which should be a string that points to the directory where the orca program is located, e.g. "orcadir=/path/to/orca_5_0_2". This option takes precedence.
If the orcadir argument has not been provided ASH will next see if orcadir has been provided in the ASH settings (~/ash_user_settings.ini file): See Basic usage
If orcadir has also not been defined at all, ASH will next search the operating systems's PATH environment variable for an executable "orca" and if found, will set the orcadir accordingly and use that ORCA version.

The latter is the most convenient option, but does require you to have already defined your shell environments correctly in the jobscript or shell-startup file. Be careful, however, if you have multiple versions of the program available.

Warning

The ORCA program binaries are nowadays often provided as a small-size shared version (has dynamically linked binaries). This means that for ORCA to run using the shared-library version, both the PATH and LD_LIBRARY_PATH needs to be set in the shell environment (should point to the ORCA directory). ASH can not set the LD_LIBRARY_PATH (must be done in the shell environment beforehand) and thus if LD_LIBRARY_PATH has not been set properly in the shell, ORCA will crash when called by ASH. This means that it is usually best to set the PATH and LD_LIBRARY_PATH to ORCA in your jobscript or login shell-file (.bashrc, .bash_profile etc.) and ASH will then be able to find ORCA like that.

Parallelization

ORCA parallelization is handled by OpenMPI. By specifying the numcores=X as a keyword when creating the ORCATheory object, a %pal numcores X end block will be added to the ORCA inputfile that ASH creates. ORCA then handles its own parallelization, will call the OpenMPI mpirun binary when needed which does requires the correct OpenMPI version to be installed and available in PATH. Make sure the recommended OpenMPI version for the ORCA version you are using is available. This typically requires setting (in the shell or jobscript):

export PATH=/path/to/openmpi/bin:$PATH
export LD_LIBRARY_PATH=/path/to/openmpi/lib:$LD_LIBRARY_PATH

or alternatively loading the appropriate module (if the computer is using modules). Set these variables in the job-script (see Basic usage) that you are using.

Examples

The ORCA interface is very flexible. orcasimpleinput and orcablocks keyword arguments (accepts single or multi-line strings) have to be provided and these keywords define what the ORCA-inputfile looks like. This means that you can completely control what type of electronic structure method should be used by ORCA including choosing aspects such as basis set, convergence and grid settings etc. The geometry block will be added to the inputfile by ASH. Note that ASH handles aspects such as telling ORCA what orbitals to read as well as parallelization.

Warning

Do not put parallelization information (! Pal4 or %pal nprocs 4 end)or job-type keywords such as "! Opt" "!Freq" to the orcasimpleinput and orcablocks variables. Both parallelization and jobtype-functionality must be handled by ASH.

#Create fragment object from XYZ-file
HF_frag=Fragment(xyzfile='hf.xyz', charge=0, mult=1)
#ORCA
input="! BP86 def2-SVP tightscf"
blocks="""
%scf
maxiter 200
end
%basis
newgto F "ma-def2-SVP" end
end
"""

ORCAcalc = ORCATheory(orcasimpleinput=input, orcablocks=blocks, numcores=8)

#Run a single-point energy job
Singlepoint(theory=ORCAcalc, fragment=HF_frag)
#An Energy+Gradient calculation
Singlepoint(theory=ORCAcalc, fragment=HF_frag, Grad=True)

Here a fragment (here called HF_frag with a defined charge and multiplicity) is defined (from an XYZ file) and passed to the Singlepoint function along with an ORCAtheory object (called ORCAcalc). The input, and blocks string variables are defined and passed onto the ORCA object via keyword arguments. By default, the ORCA autostart feature is active, meaning that if an inputfile with name "orca-input.inp" is run, ORCA will try to read orbitals from "orca-input.gbw" file if present. This is utilized automatically during geometry optimizations, numerical frequencies as well as multiple single-point calculations sequentially. It is possible to turn this off by adding "!Noautostart" in the simple-inputline of the orcasimpleinput variable or by setting autostart=False when defining ORCATheory object. It is also possible to have each ORCA-calculation read in orbitals from another source by using the: moreadfile keyword argument option:

ORCAcalc = ORCATheory(orcadir=orcadir, orcasimpleinput=input,
                    orcablocks=blocks, numcores=8, moreadfile="orbitals.gbw")

Note: For parallel-ASH calculations (ASH in parallel, ORCA in serial). The full path to the moreadfile may be required.

The ORCA object is then used by passing it to a function: e.g. Singlepoint, an optimizer, a QM/MM object, NumFreq function etc. When the ORCA object is run (e.g. by the Singlepoint function, an optimizer etc.) it will create an ORCA inputfile that will always be called orca-input.inp. This inputfile will look familiar to any ORCA user as it will contain a "Simpleinput line", Block-input a coordinate block etc. (cordinates in Å). ASH will then tell ORCA to run the inputfile and an outputfile called orca-input.out will be created. Once the ORCA calculation is done the outputfile (or other files) is read for information (usually the energy and gradient) by ASH and ASH will continue. The ORCA inputfile , "orca-input.inp" may be replaced later (e.g. if an optimization job" and ORCA will be run again.

Broken-symmetry DFT example

ORCA is quite a convenient program for finding broken-symmetry SCF solutions and within an ORCA inputfile one can easily tell ORCA to find a broken-symmetry solution within the %scf block (Flipspin or Brokensym options). While this could in principle simply be specified by the user in the orcablocks variable, this would have the drawback of ORCA attempting a broken-symmetry search everytime the program is called, e.g. in every ASH optimization step or an ASH MD run. This is almost never what we want since we simply want to find the broken-symmetry SCF solution once and then reuse those orbitals in a subsequent step (and so on). This is why ASH features a way to control the broken-symmetry search by the brokensym keyword in the ORCATheory object as shown below. In addition to the brokensym keyword we have to specify the high-spin multiplicity and which atom indices to flip as a list(atomstoflip) in the molecule. Do note that atomstoflip should always be a list of atom indices referring to the whole system. If the ORCATheory object is used to make a QMMMTheory object, the atom indices are automatically converted into QM-region indices by ASH.

#Create fragment object from XYZ-file. Here a hypothetical Fe dimer complex
frag=Fragment(xyzfile='fedimer.xyz', charge=0, mult=1)

#ORCA settings
inputline="! BP86 def2-SVP tightscf UKS "
#Here we specify a broken-symmetry solution with a high-spin multiplicity of 11 and flipping atoms no. 0
ORCAcalc = ORCATheory(orcasimpleinput=inputline, brokensym=True, HSmult=11, atomstoflip=[0])

#Run a broken-symmetry DFT geometry optimization
Optimizer(theory=ORCAcalc, fragment=frag)

Running the above job would have the effect of ASH initially writing an ORCA inputfile containing broken-symmetry settings (Flipspin and FinalMS keywords, high-spin multiplicity etc.) but this would only apply to the first step of the geometry optimization. Once the ORCATheory object has been run once with broken-symmetry settings, the broken-symmetry feature is automatically turned off. The next time the ORCATheory object is run (the next geometry optimization step of the above example), ASH creates an ORCA inputfile with regular SCF inputsettings with the spin multiplicity being the low-spin BS multiplicity. Since the broken-symmetry SCF orbitals are available in the GBW file they are automatically loaded.

Warning

Do note that when finding broken-symmetry singlets it is important to use the UKS keyword in the ORCA inputfile when performing a job other than a single-point job (e.g. optimization). This is because to stay on the broken-symmetry surface, ORCA will read in the GBW file from the previous broken-symmetry GBW file and then it is important for an unrestricted SCF to be performed. The default for singlets in ORCA is to run a RKS closed-shell SCF.

ORCA_External_Optimizer: Using ORCA algorithms using ASH Theories

It is possible to use ORCA as an external optimizer or job-driver This means that the ORCA algorithms, e.g. Optimizer and the GOAT conformational sampler can be used with an ASH Theory level as input.

def ORCA_External_Optimizer(fragment=None, theory=None, orcadir=None, charge=None, mult=None,
                            ORCA_jobkeyword="Opt", ORCA_blockinput="", actatoms=None):

ORCA_jobkeyword can be any valid ORCA-job keyword in principle. In practice the following keywords make sense: - 'Opt' (for using the ORCA geometry optimizer), - 'GOAT'

Additional block-options to ORCA can in principle be provided using ORCA_blockinput . If the ORCA optimizer (or related job) should only optimize certain atoms, i.e. there is an active region, then the ORCA optimizer must enforce constraints. These constraints are automatically set up (by ASH when creating the ORCA inputfile) if a list of active-atoms are provided to the actatoms keyword argument.

Basic example:

from ash import *

# H2O Fragment
frag = Fragment(databasefile="h2o.xyz")
# PySCFTheory
pys = PySCFTheory(scf_type="RKS", functional="b3lyp", basis="def2-SVP")
# Calling ORCA_External_Optimizer
ORCA_External_Optimizer(fragment=frag, theory=pys, ORCA_jobkeyword="Opt")

QM/MM optimization with an active region:

from ash import *
# H2O...MeOH fragment defined. Reading XYZ file
H2O_MeOH = Fragment(xyzfile=f"h2o_MeOH.xyz")

# Write PDB-file for OpenMM (used for topology)
H2O_MeOH.write_pdbfile_openmm(filename="h2o_MeOH.pdb", skip_connectivity=True)
pdbfile="h2o_MeOH.pdb"

# Specifying the QM atoms (3-8) by atom indices (MeOH). The other atoms (0,1,2) is the H2O and MM.
# IMPORTANT: atom indices begin at 0.
qmatoms=[3,4,5,6,7,8]
# QM
qm = PySCFTheory(scf_type="RKS", functional="PBE", basis="def2-SVP", densityfit=False)
# MM: OpenMMTheory using XML-file
MMpart = OpenMMTheory(xmlfiles=[f"MeOH_H2O-sigma.xml"], pdbfile=pdbfile, autoconstraints=None, rigidwater=False)
# Creating QM/MM object
QMMMobject = QMMMTheory(fragment=H2O_MeOH, qm_theory=qm, mm_theory=MMpart, qmatoms=qmatoms,
                        embedding='Elstat', qm_charge=0, qm_mult=1)
# ORCA EXTOPT on ASH QM/MM theory with an active region
ORCA_External_Optimizer(ORCA_jobkeyword="Opt", fragment=H2O_MeOH, theory=QMMMobject, actatoms=qmatoms, charge=0, mult=1)

Wrapper around ORCA helper programs

ASH features wrappers around useful ORCA programs such as orca_plot, orca_mapspc and orca_2mkl. This allows you to conveniently use these sub-programs within a Python script and as part of an ASH workflow.

run_orca_plot

# Simple Wrapper around orca_plot for creating Cube-files of MOs or densitities.
def run_orca_plot(filename, option, orcadir=None, gridvalue=40,densityfilename=None, mo_operator=0, mo_number=None):

Filename should be the name of the ORCA-GBW file (e.g. file.gbw, file.loc, file.qro etc.). Option should be either 'density', 'mo', 'cisdensity', 'spindensity', 'cisspindensity'. Gridvalue is by default 40 (same as in orca_plot). The orcadir keyword is optional, ASH will try to find orca_plot in your PATH environment if not present.

For option='mo' you should also provide mo_number (valid integer) and mo_operator (0 or 1 for alpha and beta respectively): e.g. mo_number=17 and mo_operator=1 to plot beta-MO no. 17 For the density-options you should also provide the name of the density file. Example:

#Example on how to plot multiple MO's
for mo_index in [17,20,25,30]:
  run_orca_plot("file.gbw", 'mo', gridvalue=50, mo_operator=mo_index, mo_number=0)

run_orca_mapspc

# Simple Wrapper around orca_mapspc to create a broadened spectrum from a ORCA outputfile (creates .dat and .stk files)
def run_orca_mapspc(filename, option, start=0.0, end=100, unit='eV', broadening=1.0, points=5000, orcadir=None):

make_molden_file_ORCA

#Make a Molden file from ORCA GBW file (uses orca_2mkl)
def make_molden_file_ORCA(GBWfile, orcadir=None):

ORCA fragment guess

It is possible to use the function orca_frag_guess to divide an ASH fragment into two fragments, run an ORCA calculation on each fragment using an ORCATheory level and then combine the orbitals from the two fragments into a single GBW file (uses orca_mergefrag). This could be utilized to make a more accurate guess of the whole system.

#Make an ORCA fragment guess. Returns name of GBW-file created ("orca_frag_guess.gbw")
def orca_frag_guess(fragment=None, theory=None, A_indices=None, B_indices=None, A_charge=None, B_charge=None, A_mult=None, B_mult=None):

ORCA_JSON

Since ORCA 6.0, ORCA-JSON feature has become more powerful, allowing for extracting MOs and all integrals from an ORCA GBW-file. ASH features a few functions for conveniently creating or reading ORCA-JSON files.

#Wrapper around orca_2json to create JSON file from ORCA GBW file
def create_ORCA_json_file(file, orcadir=None, format="json", basis_set=True, mo_coeffs=True, one_el_integrals=True,
                          two_el_integrals=False, two_el_integrals_type="ALL", dipole_integrals=False, full_int_transform=False):

#Read ORCA json file: MO-coefficients, MO-energies, basis set, H,S,T matrices, 2-electron ints, densities etc.
#Returns a dictionary with all information
def read_ORCA_json_file(file):

#Read ORCA MSPack (JSON-like binary format) file
#Returns a dictionary with all information
def read_ORCA_msgpack_file(file):

#Read ORCA BSON (JSON-like binary format) file
#Returns a dictionary with all information
def read_ORCA_bson_file(file):

#Get densities from data dictionary (from read_ORCA_json_file)
def get_densities_from_ORCA_json(data):

#Grab ORCA wfn from jsonfile or data-dictionary. Returns DM_AO,C,S, MO_occs, MO_energies, AO_basis, AO_order
def grab_ORCA_wfn(data=None, jsonfile=None, density=None):

#Reverse JSON to GBW
def create_GBW_from_json_file(jsonfile, orcadir=None):

Warning

Do note that if the GBW-file contains a ROHF wavefunction then this will most likely not work due to the lack of ORCA-JSON handling for ROHF.

Example: grabbing integrals and MOs

Below is an example of how to grab the kinetic energy matrix and the MO-information (MO coefficients, MO-energies)

from ash import *

#Create fragment and ORCA-THeory
frag = Fragment(diatomic="HHe", bondlength=1.3, charge=1, mult=1)
theory = ORCATheory(orcasimpleinput="! RHF STO-3G tightscf")
#Run singlepoint calculation
Singlepoint(theory=theory, fragment=frag)
#Create the JSON-file from the ORCA-created GBW-file, specifying what we want ORCA to print in the JSON-file
jsonfile = create_ORCA_json_file(theory.filename+'.gbw', format="json", basis_set=True, mo_coeffs=True, one_el_integrals=True,
                          two_el_integrals=True)
#Read the JSON-file
data = read_ORCA_json_file(jsonfile)
print("The available objects in the data dictionary:", data.keys())
print("\nT-Matrix:\n")
print(data["T-Matrix"])
print("\nTHe MO information:\n")
print(data["MolecularOrbitals"])

Creating FCIDUMP file from ORCA

The ORCA-JSON functionality can also be utilized to create FCIDUMP files using the function create_ORCA_FCIDUMP.

def create_ORCA_FCIDUMP(gbwfile, header_format="FCIDUMP", filename="FCIDUMP_ORCA", orca_json_format="msgpack",
                        int_threshold=1e-16,  mult=1, full_int_transform=False,
                        convert_UHF_to_ROHF=True):

Examples:

# Create standard FCIDUMP file from ORCA GBW-file
create_ORCA_FCIDUMP("orca.gbw", header_format="FCIDUMP", filename="FCIDUMP_ORCA",
                      int_threshold=1e-16, scf_type="RHF", mult=1)
# Create MRCC-style FCIDUMP-file (fort.55) from ORCA GBW-file
create_ORCA_FCIDUMP("orca.gbw", header_format="MRCC", int_threshold=1e-16, scf_type="RHF", mult=1)

Warning

Do note that if the GBW-file contains a ROHF wavefunction then this will most likely not work due to the lack of ORCA-JSON handling for ROHF.

Warning

If a UHF/UKS WF is found, then this is currently not handled. However, the convert_UHF_to_ROHF keyword can be set to True to make a naive conversion of UHF/UKS to ROHF.

Creating natural-orbitals from a correlated WF density as a Molden-file

It is possible to use the JSON-interface together with some ASH functionality to conveniently get correlated WF densities as Molden-files. The example below shows how we can grab a correlated WFN-density (here a FIC-MRCC density) from a previous ORCA-job, diagonalize the densitry matrix to get natural orbitals and then write the natural orbitals to a Molden-file. This Molden-file can then conveniently be read by Multwfn for example.

from ash import *

# Fragment
fragment=Fragment(xyzfile="FeS2-caspt2.xyz", charge=-1, mult=6)

# GBW-file associated with a previously run ORCA-job:
gbwfile="CASSCF_5_5.gbw"
# Also present in dir: CASSCF_5_5.densities, CASSCF_5_5.densitiesinfo

# Create JSON-file from ORCA-GBW and density-files
jsonfile = create_ORCA_json_file(gbwfile, format="json")
#Read the JSON-file
data = read_ORCA_json_file(jsonfile)

#Get densities from data dictionary (from read_ORCA_json_file)
get_densities_from_ORCA_json(data)

#Grab ORCA density from jsonfile or data-dictionary. Returns DM_AO,C,S, MO_occs, MO_energies, AO_basis, AO_order
DM_AO,C,S, MO_occs, MO_energies, AO_basis, AO_order = grab_ORCA_wfn(jsonfile=jsonfile, density="mult.6.root.0.FIC-MRCC.autoci.p")

#Diagonalize density matrix
natorb, natocc = diagonalize_DM_AO(DM_AO, S)
# Create Molden-file. Note: probably not compatible with Chemcraft, but will work with Multiwfn
make_molden_file(fragment, AO_basis, natorb, MO_energies=None, MO_occs=natocc, AO_order=AO_order,
    label="ASH_orbs", spherical_MOs=True)

Workflow to automate ORCA-orbital creation

ORCA is capable of producing various types of orbitals such as SCF-orbitals (RHF,UHF,ROHF etc.), MP2 natural orbitals, CC natural orbitals, MRCI natural orbitals. The natural orbitals from WFT require a bit of know-how. To automate the creation of these orbitals, ASH features a function called ORCA_orbital_setup.

#Function to prepare ORCA orbitals for another ORCA calculation
def ORCA_orbital_setup(orbitals_option=None, fragment=None, basis=None, basisblock="", extrablock="", extrainput="", label="frag",
        MP2_density=None, MDCI_density=None, memory=10000, numcores=1, charge=None, mult=None, moreadfile=None,
        gtol=2.50e-04, nmin=1.98, nmax=0.02, CAS_nel=None, CAS_norb=None,CASCI=False, natorb_iterations=None,
        FOBO_excitation_options=None, MRCI_natorbiterations=0, MRCI_tsel=1e-6,
        ROHF=False, ROHF_case=None, MP2_nat_step=False, MREOMtype="MR-EOM",
        NMF=False, NMF_sigma=None):

Example on how to get CCSD natural orbitals from an unrelaxed CCSD density:

newmofile, nat_occupations = ORCA_orbital_setup(orbitals_option="CCSD", fragment=frag, label="CCSD"
              basis="def2-SVP", MDCI_density="unrelaxed", charge=0, mult=1)
# Returns name of the MO-file (here called CCSD_orca.mdci.nat)

Useful ORCA functions

In addition to the ORCATheory class, there are a number of built-in functions in ASH that are useful for ORCA functionality. For example functions to grab specific information from an ORCA outputfile etc. To use most these functions, the module has to be loaded first:

from ash.interfaces.interface_ORCA.py import *

Functions for grabbing information from ORCA outputfiles:

#Simple function that grabs elements and coordinates from ORCA outputfile
def grab_coordinates_from_ORCA_output(filename):

#Grab Final single point energy. Ignoring possible encoding errors in file
def ORCAfinalenergygrab(file, errors='ignore'):

#Grab multiple Final single point energies in output. e.g. new_job calculation
def finalenergiesgrab(file):

#Grab SCF energy (non-dispersion corrected)
def scfenergygrab(file):

#Grab HF and correlation energies from ORCA output
def grab_HF_and_corr_energies(file, DLPNO=False, F12=False):

#Grab energies from unrelaxed scan in ORCA (paras block type)
def grabtrajenergies(filename):

#Grab ORCA timings. Return dictionary
def ORCAtimingsgrab(file):

#Grab gradient from ORCA engrad file
def ORCAgradientgrab(engradfile):

#Grab pointcharge gradient from ORCA pcgrad file
def ORCApcgradientgrab(pcgradfile):

#Grab XES state energies and intensities from ORCA output
def xesgrab(file):

#Grab TDDFT state energies from ORCA output
def tddftgrab(file):

#Grab TDDFT state intensities from ORCA output
def tddftintens_grab(file):

#Grab TDDFT orbital pairs from ORCA output
def tddft_orbitalpairs_grab(file):

#Grab molecular orbital energies from ORCA outputfile
def MolecularOrbitalGrab(file):

#Grab QRO energies from ORCA outputfile
def QRO_occ_energies_grab(filename):

#Grab <S**2> expectation values from outputfile
def grab_spin_expect_values_ORCA(file):

#Grab MP2 natural occupations from ORCA outputfile
def MP2_natocc_grab(filename):

#Grab SCF FOD occupations from ORCA outputfile
def SCF_FODocc_grab(filename):

#Grab CASSCF natural occupations from ORCA outputfile
def CASSCF_natocc_grab(filename):

#Find localized orbitals in ORCA outputfile for a given element. Returns orbital indices (to be fed into run_orca_plot)
def orblocfind(outputfile, atomindex_strings=None, popthreshold=0.1):

#Grab spin populations from ORCA outputfile
def grabspinpop_ORCA(chargemodel,outputfile):

#Grab atomic charges from ORCA outputfile
def grabatomcharges_ORCA(chargemodel,outputfile):

#Grab IPs from an EOM-IP calculation and also largest singles amplitudes.
def grabEOMIPs(file):

#Grab electric field gradients from ORCA outputfile
def grab_EFG_from_ORCA_output(filename):

#Grab ICE-WF info from CASSCF job
def ICE_WF_size(filename):

#Grab ICE-WF CFG info from CI job
def ICE_WF_CFG_CI_size(filename):

#Reading stability analysis from output. Returns true if stab-analysis good, otherwise falsee
def check_stability_in_output(file):

Functions related to ORCA Hessian files:

#write ORCA-style Hessian file
def write_ORCA_Hessfile(hessian, coords, elems, masses, hessatoms,outputname):

#Function to grab Hessian from ORCA-Hessian file. Returns 2d Numpy array
def Hessgrab(hessfile):

#Grab coordinates from ORCA-Hessian file. Returns elements and coordinates.
def grabcoordsfromhessfile(hessfile):

#Function to grab masses and elements from an ORCA Hessian file
def masselemgrab(hessfile):

#Read ORCA Hessian-file and return Hessian, elems, coords and masses
def read_ORCA_Hessian(hessfile):

#Grab frequencies from ORCA-Hessian file
def ORCAfrequenciesgrab(hessfile):

Functions for creating ORCA inputfiles:

#Create PC-embedded ORCA inputfile from elems,coords, input, charge, mult,pointcharges
def create_orca_input_pc(name,elems,coords,orcasimpleinput,orcablockinput,charge,mult, Grad=False, extraline='',
                        HSmult=None, atomstoflip=None, Hessian=False, extrabasisatoms=None, extrabasis=None,
                        moreadfile=None, propertyblock=None, fragment_indices=None):

#Create simple ORCA inputfile from elems,coords, input, charge, mult,pointcharges
def create_orca_input_plain(name,elems,coords,orcasimpleinput,orcablockinput,charge,mult, Grad=False, Hessian=False, extraline='',
                            HSmult=None, atomstoflip=None, extrabasis=None, extrabasisatoms=None, moreadfile=None, propertyblock=None,
                            ghostatoms=None, dummyatoms=None,fragment_indices=None):

# Create ORCA pointcharge file based on provided list of elems and coords (MM region elems and coords) and list of point charges of MM atoms
def create_orca_pcfile(name,coords,listofcharges):

# Chargemodel select. Creates ORCA-inputline with appropriate keywords
def chargemodel_select(chargemodel):

Functions for other ORCA functionality:

#Print gradient in ORCA format to disk
def print_gradient_in_ORCAformat(energy,gradient,basename):

Useful ORCA workflows

Examples of useful ways to automate various ORCA calculations.

Plot ORCA-calculated spectra (using orca_mapspc) and normalize

Uses ASH functions: grab_coordinates_from_ORCA_output, run_orca_mapspc, read_datafile, write_datafile

from ash import *
import glob

#Simple ASH script to plot XES spectra from multiple ORCA XES-job outputfiles and normalize w.r.t. to number of absorber elements
absorber_element="Fe"

#orca_mapspc settings
orca_mapspc_option='XESQ'
broadening=1.0
numpoints=5000
start_value=0
end_value=8000
unit='eV'

#Loop over ORCA outputfiles and run orca_mapspc
for outfile in glob.glob("*.out"):
    print("Outfile:", outfile)
    #Get number of absorber elements in molecule from outputfile
    elems,coords = grab_coordinates_from_ORCA_output(outfile)
    elementcount = elems.count(absorber_element)
    print(f"Number of {absorber_element} atoms in file:", elementcount)
    #Get XES .at and .stk files via orca_mapspc
    run_orca_mapspc(outfile, orca_mapspc_option, start=start_value, end=end_value, unit=unit, broadening=broadening, points=numpoints)
    #Read .dat file. Get x and y values as numpy arrays
    x, y = read_datafile(outfile+".xesq.dat")
    #Scale y-values
    scalingfactor=elementcount
    write_datafile(x,y/scalingfactor, filename=outfile+f"_SCALED_by_{scalingfactor}.xesq.dat")
    #Read .stk file
    x, y = read_datafile(outfile+".xesq.stk")
    #Scale y-values
    write_datafile(x,y/scalingfactor, filename=outfile+f"_SCALED_by_{scalingfactor}.xesq.stk")
#