E-Z isomers of
3-(4-nitrophenyl)but-2-en-2-yl triflate

Your advisor wants you to compute ground state energies for the two isomers of 3-(4-nitrophenyl)but-2-en-2-yl triflate. First, run a standard optimization/frequency calculation for the isomers at the M062X/def2svp level of theory. As always, you can check your work at the code repo.

Do not wait for these calculations to finish. Even running on the cluster, they will take a while. The best thing to do is to submit them and then go set up a reaction or something. If you’re running on the chem nodes you can expect these jobs to take around 1 to 2 hours.

1) Specifying different built-in basis sets for different atoms

You take your results back to your advisor who doesn’t seem very satisfied. They tell you to make sure the substituents are right by beefing up the basis set on the heteroatoms.

Modify your calculations to use the triple $\zeta$ basis set def2tzvp on all heteroatoms and def2svp on C and H. Give it a shot before checking your input files against those at the code repo.

2) Assigning built-in basis sets to individual atoms

Still unhappy with the results, your advisor tells you to re-run the computations, but this time placing diffuse basis functions on just the nitro group atoms. The basis set def2tzvpd which has the diffuse functions added to def2tzvp isn’t built in to Gaussian so you figure that another similarly large basis set with diffuse functions aug-cc-pvtz that is built-in would work just as well.

Run your computations again, this time use aug-cc-pvtz to describe the N and only the two O of the nitro group. Keep everything else the same i.e.,

3) Incorporating external basis sets into Gaussian calculations

After a few second guesses, you’re unsure of whether or not aug-cc-pvtz is really a suitable substitute for def2tzvpd; you also realize that your computations are taking quite a while, and the other group members are starting to get upset that you’re hogging the new compute nodes. You decide to try using def2tzvpd instead. Re-run your calculations, this time use def2tzvpd instead of aug-cc-pvtz to describe the N and only the two O of the nitro group. Keep everything else the same i.e.,

Remember that def2tzvpd is not built into g16 so you’ll have to get the basis set from the Basis Set Exchange.$^1$ There are two ways to accomplish this task; see if you can figure them both out before going to the code repo!

Hint: These are the diffuse functions from def2tzvpd

N 0
S    1   1.00
      0.68441605847D-01      1.0000000
D    1   1.00
      0.12829642058          1.0000000
****
O 0
S    1   1.00
      0.70288026270D-01      1.0000000
P    1   1.00
      0.51112745706D-01      1.0000000
D    1   1.00
      0.14696477366          1.0000000

Key takeaways

Once your jobs have finished, extract the corrected energies from your results. I’ve placed mine in the table below if you’re just following along (the input/output files are available in the code repo).

Using these energies can you justify the product distribution observed in the triflation of 3-(p-nitro)phenyl-2-butanone (products 5g/6g) in this paper?2

Basis Set Isomer Energy
/ kcal mol-1
ΔG(E<->Z)
/ kcal mol-1
Total Computation Time
/ min
def2svp(all) E
Z
-973863.597
-973862.450
1.147 37
47
def2tzvp (SNOF)
def2svp (CH)
E
Z
-974540.402
-974539.491
0.911 87
92
aug-cc-pvtz (nitro)
def2tzvp (trifyl)
def2svp (CH)
E
Z
-974540.579
-974539.693
0.886 120
134
def2tzvpd (nitro)
def2tzvp (trifyl)
def2svp (CH)
E
Z
-974542.096
-974541.205
0.891 99
110


There are some key takeaway from the data above:

  1. The answer never formally changes. In all cases the E isomer, as we expect, is more stable than the Z isomer.
  2. The caveat is that depending on our choice of basis set, we do see changes in the relative energies of the two species; namely, the relative energies tend converge with increasing basis set size.
  3. Nevertheless, its important to not lose sight of the forrest in the trees. Look again in the predicted relative energies. In the “worst” case we there is a 1.15 kcal mol-1 difference between the isomers; in the “best” case, only 0.89 kcal mol-1. The difference in these two predictions is a mere 0.25 kcal mol-1; it is simple to use this as justification for more computationally intensive calculations, however consider for a second the experimental implications of this value.
    A reaction under control of a 1.15 kcal mol-1 ΔΔG would predict 11% minor product formation, while one with a 0.89 kcal mol-1 ΔΔG would predict 16% of the minor product; barely something to split hairs over.
  4. Computational time, while relatively cheap, is not free. The difference in relative energy that comes from using the diffuse augmented basis sets is a whopping 0.025 kcal mol-1 (or 25 thousandths of a kcal). If this number seems small to you now, consider it in the context of the computational time.
    Augmenting just three atoms in our molecule with the diffuse functions of aug-cc-pvtz increased our total computational time from 87 min to 120 min, a 40% increase in resources. All for 0.025 kcal mol-1. Realizing that a ΔΔG of 0.025 kcal mol-1 erodes a selectivity by less than 1%, it seems a little silly. Note that DFT scales in cubic time ($\mathcal{O}(n_e^3) $) with respect to the number of electrons in your system so as the size of your molecule increases this issue will only get much worse.3 Take a look at this paper for a discussion on the necessity of diffuse functions.4


Computational chemistry is all about choosing which assumptions to make because all models must make assumptions, i.e., there is no free lunch. In the most rigorous sense we can, we are always searching for the good enough method that balances chemical accuracy with computational cost.


Home


Resources

(1) Basis Set Exchange
(2) Vinyl cations. 12. Mechanism of reaction of cis- and trans-3-phenyl-2-buten-2-yl triflates. Evidence for vinylidene phenonium ions by Peter J. Stang and Thomas E. Dueber J. Am. Chem. Soc. 1977, 99 (8), 2602
(3) Max Hutchinson on CompSci Stack Exchange
(4) Is the Use of Diffuse Functions Essential for the Properly Description of Noncovalent Interactions Involving Anions? by Antonio Bauzá, David Quiñonero, Pere M. Deyà, and Antonio Frontera J. Phys. Chem. A 2013, 117 (12), 2651