Your advisor wants you to compute ground state energies for the two isomers of 3-(4-nitrophenyl)but-2-en-2-yl triflate. First, run a standard optimization/frequency calculation for the isomers at the M062X/def2svp
level of theory. As always, you can check your work at the code repo.
Do not wait for these calculations to finish. Even running on the cluster, they will take a while. The best thing to do is to submit them and then go set up a reaction or something. If you’re running on the
chem
nodes you can expect these jobs to take around 1 to 2 hours.
1) Specifying different built-in basis sets for different atoms
You take your results back to your advisor who doesn’t seem very satisfied. They tell you to make sure the substituents are right by beefing up the basis set on the heteroatoms.
Modify your calculations to use the triple $\zeta$ basis set def2tzvp
on all heteroatoms and def2svp
on C and H. Give it a shot before checking your input files against those at the code repo.
2) Assigning built-in basis sets to individual atoms
Still unhappy with the results, your advisor tells you to re-run the computations, but this time placing diffuse basis functions on just the nitro group atoms. The basis set def2tzvpd
which has the diffuse functions added to def2tzvp
isn’t built in to Gaussian so you figure that another similarly large basis set with diffuse functions aug-cc-pvtz
that is built-in would work just as well.
Run your computations again, this time use aug-cc-pvtz
to describe the N and only the two O of the nitro group. Keep everything else the same i.e.,
- C H:
def2svp
- S F O(trifyl):
def2tzvp
- N O(nitro):
aug-cc-pvtz
3) Incorporating external basis sets into Gaussian calculations
After a few second guesses, you’re unsure of whether or not aug-cc-pvtz
is really a suitable substitute for def2tzvpd
; you also realize that your computations are taking quite a while, and the other group members are starting to get upset that you’re hogging the new compute nodes. You decide to try using def2tzvpd
instead. Re-run your calculations, this time use def2tzvpd
instead of aug-cc-pvtz
to describe the N and only the two O of the nitro group. Keep everything else the same i.e.,
- C H:
def2svp
- S F O(trifyl):
def2tzvp
- N O(nitro):
def2tzvpd
Remember that def2tzvpd
is not built into g16 so you’ll have to get the basis set from the Basis Set Exchange.$^1$ There are two ways to accomplish this task; see if you can figure them both out before going to the code repo!
Hint: These are the diffuse functions from def2tzvpd
N 0
S 1 1.00
0.68441605847D-01 1.0000000
D 1 1.00
0.12829642058 1.0000000
****
O 0
S 1 1.00
0.70288026270D-01 1.0000000
P 1 1.00
0.51112745706D-01 1.0000000
D 1 1.00
0.14696477366 1.0000000
Key takeaways
Once your jobs have finished, extract the corrected energies from your results. I’ve placed mine in the table below if you’re just following along (the input/output files are available in the code repo).
Using these energies can you justify the product distribution observed in the triflation of 3-(p-nitro)phenyl-2-butanone (products 5g/6g) in this paper?2
Basis Set | Isomer | Energy / kcal mol-1 |
ΔG(E<->Z) / kcal mol-1 |
Total Computation Time / min |
---|---|---|---|---|
def2svp(all) | E Z |
-973863.597 -973862.450 |
1.147 | 37 47 |
def2tzvp (SNOF) def2svp (CH) |
E Z |
-974540.402 -974539.491 |
0.911 | 87 92 |
aug-cc-pvtz (nitro) def2tzvp (trifyl) def2svp (CH) |
E Z |
-974540.579 -974539.693 |
0.886 | 120 134 |
def2tzvpd (nitro) def2tzvp (trifyl) def2svp (CH) |
E Z |
-974542.096 -974541.205 |
0.891 | 99 110 |
There are some key takeaway from the data above:
- The answer never formally changes. In all cases the E isomer, as we expect, is more stable than the Z isomer.
- The caveat is that depending on our choice of basis set, we do see changes in the relative energies of the two species; namely, the relative energies tend converge with increasing basis set size.
- Nevertheless, its important to not lose sight of the forrest in the trees. Look again in the predicted relative energies. In the “worst” case we there is a 1.15 kcal mol-1 difference between the isomers; in the “best” case, only 0.89 kcal mol-1. The difference in these two predictions is a mere 0.25 kcal mol-1; it is simple to use this as justification for more computationally intensive calculations, however consider for a second the experimental implications of this value.
A reaction under control of a 1.15 kcal mol-1 ΔΔG would predict 11% minor product formation, while one with a 0.89 kcal mol-1 ΔΔG would predict 16% of the minor product; barely something to split hairs over. - Computational time, while relatively cheap, is not free. The difference in relative energy that comes from using the diffuse augmented basis sets is a whopping 0.025 kcal mol-1 (or 25 thousandths of a kcal). If this number seems small to you now, consider it in the context of the computational time.
Augmenting just three atoms in our molecule with the diffuse functions ofaug-cc-pvtz
increased our total computational time from 87 min to 120 min, a 40% increase in resources. All for 0.025 kcal mol-1. Realizing that a ΔΔG of 0.025 kcal mol-1 erodes a selectivity by less than 1%, it seems a little silly. Note that DFT scales in cubic time ($\mathcal{O}(n_e^3) $) with respect to the number of electrons in your system so as the size of your molecule increases this issue will only get much worse.3 Take a look at this paper for a discussion on the necessity of diffuse functions.4
Computational chemistry is all about choosing which assumptions to make because all models must make assumptions, i.e., there is no free lunch. In the most rigorous sense we can, we are always searching for the good enough method that balances chemical accuracy with computational cost.
Resources
(1) Basis Set Exchange
(2) Vinyl cations. 12. Mechanism of reaction of cis- and trans-3-phenyl-2-buten-2-yl triflates. Evidence for vinylidene phenonium ions by Peter J. Stang and Thomas E. Dueber J. Am. Chem. Soc. 1977, 99 (8), 2602
(3) Max Hutchinson on CompSci Stack Exchange
(4) Is the Use of Diffuse Functions Essential for the Properly Description of Noncovalent Interactions Involving Anions? by Antonio Bauzá, David Quiñonero, Pere M. Deyà, and Antonio Frontera J. Phys. Chem. A 2013, 117 (12), 2651