On the edge of tomorrow

ChatGPT Prompt Frameworks

2024-05-23T14:00:00+00:00

Unlock the full potential of ChatGPT and LLMs. Learn these four simple prompt frameworks to improve the responses from the model.

R-T-F: Role - Task - Format. Act as a ROLE, Create a TASK, Show as FORMAT.

T-A-G: Task - Action - Goal. Define the TASK, State the ACTION, Clarify the GOAL.

B-A-B: Before - After - Bridge. Explain problem BEFORE, State outcome AFTER, ask the BRIDGE.

C-A-R-E: Context - Action - Result - Example: Give the CONTEXT, Describe the ACTION, Clarify the RESULTS, Give the EXAMPLES.

Eager to use other strategic planning frameworks like SWOT analysis from the business world? That works too.

S-W-O-T: Strengths - Weaknesses - Opportunities - Threats: Analyze STRENGTHS, acknowledge WEAKNESSES, explore and consider OPPORTUNITIES, and consider THREATS.

How do these frameworks work for you? Others to share?

TIL: SORT, UNIQUE, VSTACK

2024-04-03T17:00:00+00:00

How to use three formulas to combine and sort the unique values from two different lists (arrays)

Imagine two very long lists of unique codes (names, id numbers, any unique identifier). You need a single list of the unique codes. There are several approaches but I learned about VSTACK recently, have wanted to use it, and had to look it up again to apply it, so I am writing this as a TIL - today I learned.

Use the two lists to combine (VSTACK) them into a single list of unique values (UNIQUE) that is sorted (SORT).

In the screen shot, the array formula in cell D3 combines these three Excel functions to produce the sorted list of unique alpha codes. Two adjacent Boolean columns give a 1 or a 0 depending on whether the alpha code is from list one or two. ISNUMBER and MATCH are used with double unary characters to return the 1’s and 0’s. A value of 1 indicates it was from that list; a value of 0 indicates it was not.

The final column, Source, uses an IF formula and the Boolean columns to indicate where the alpha code appeared–list 1, list 2, or both lists. Careful readers will note that EVER appears in both lists and the IF formula correctly identifies that in column G.

Formulas

D3: = SORT( UNIQUE( VSTACK(A3:A7, B3:B8)))

E3: = --ISNUMBER( MATCH( D3#, $A$3:$A$7, 0))

F3: = --ISNUMBER( MATCH( D3#, $B$3:$B$8, 0))

G3: =IF(AND(E3, F3), "Both", IF(E3, "List 1", "List 2"))

Functions

VSTACK - Appends arrays vertically and in sequence to return a larger array.

SORT - The SORT function sorts the contents of a range or array.

UNIQUE - The UNIQUE function returns a list of unique values in a list or range.

ISNUMBER - checks whether a specified value is a number (TRUE) or not (FALSE)

MATCH - The MATCH function searches for a specified item in a range of cells, and then returns the relative position of that item in the range.

IF - makes logical comparisons and can have two results, one if the comparison is TRUE and the other if the comparison is FALSE.

Peaking behind the curtain for International Women’s Day

2024-03-08T19:00:00+00:00

Behind F1’s Velvet Curtain, by Kate Wagner ⇥ web.archive.org

Tagline: If you wanted to turn someone into a socialist you could do it in about an hour by taking them for a spin around the paddock of a Formula 1 race. The kind of money I saw will haunt me forever.

Kate Wagner

It’s 2024. We still do not have equality. We still need to fight for it daily. For this International Women’s Day I want to amplify a voice that was recently suppressed by those with more power. This is a case study in speaking truth to power, holding yourself and each other accountable, and maintaining your convictions viewed through the lens of the failure of the media industry and the deepening class divide.

I have been reading Kate Wagner’s work for several years after discovering her coverage of the Tour de France. Many may know about the viral Twitter account and website, McMansion Hell, where Kate lampoons the subject of gigantic homes. I was definitely already a fan of McMansion Hell when Zillow attempted to bully Kate into shutting down the site by issuing a cease and desist letter. The Electronic Frontier Foundation (EFF) agreed to represent Kate and Zillow backed down. So this week’s drama is clearly not the first time Kate has faced corporate bullying.

Last week Kate Wagner’s excellent piece covering the luxury world of Formula One was published online by Road & Track. Hours later it was pulled with no explanation. The blowback began earlier this week when Will Sommer at the Washington Post published an article investigating the story of why a 5,000-word, commissioned story that was months in the making, that has drawn widespread praise “because of the unlikely pairing of writer and subject”, would be pulled – disappearing from the Internet without explanation.

Someone in a position of power wants all of this to go away, including Kate – her author profile page on Road & Track briefly went missing this week. Either sponsors, Mercedes Benz and INEOS, or the editor-in-chief, Daniel Pund, do not want you to read this article.

So please take a moment to read it. We should be elevating voices like Kate’s, not silencing them. Pay no attention to the man (or sponsors) behind the curtain!

References

A socialist writer skewered the Formula One scene. Then her article vanished., Will Sommer, Washington Post, 2024-03-05.
Road & Track EIC Tries To Explain Why He Deleted An Article About Formula 1 Power Dynamics, Patrick Redford, Defector, 2024-03-05.
Behind F1’s Velvet Curtain, Nick Heer at pxlnv.com.
Kate Wagner’s bio on Road & Track. Don’t be surprised if it 404s.
PDF of Behind F1’s Velvet Curtain, by Kate Wagner just in case the Internet Archive’s version gets “lost”.

Asset management programs face ongoing maintenance deficits

2024-02-24T17:00:00+00:00

“Another flaw in the human character is that everybody wants to build and nobody wants to do maintenance.”

Kurt Vonnegut, Hocus Pocus

Viewed through the lens of total cost of facility ownership, the initial 20 to 30% of life-time costs for a constructed asset (building, road, bridge, etc.) occur in the first 2 to 4 years of existence accounting for planning, design, construction, and start-up. What about the remaining 70 to 80% of costs? The remaining 80% represents the majority of the cost an owner will incur – for operating, maintaining, recapitalizing, and disposing. Despite that, every client I have ever worked with faces significant funding shortfalls for maintenance and recapitalization. We celebrate ribbon cuttings; no one celebrates ongoing maintenance.

As consultants, we all probably follow a rigorous quality management plan and quality assurance / quality control processes but we are all human. Mistakes happen. A colleague recently asked if I worried about making mistakes in deliverables. I said I did not because in our line of consulting, no one will die from one of those mistakes. That was not always the case when I was a consultant specializing in bridge design & inspection.

This scenario used to give me nightmares – a bridge I had inspected collapsing sometime after the inspection. I cannot tell you how many bridges my teams inspected over the years with significant section loss in structural members. We closed major interstates and lanes of major bridges. As of 2021, the American Society of Civil Engineers (aka ASCE) gave America’s infrastructure an overall score of C- and bridges a C.

NTSB shares probable cause of Fern Hollow Bridge collapse

This week, an engineering friend pointed me to the NTSB’s findings on the collapse of the Fern Hollow Bridge in Pittsburgh:

“The collapse began when the transverse tie plate on the southwest bridge leg failed due to extensive corrosion and section loss. The corrosion and section loss resulted from clogged drains that caused water to run down bridge legs and accumulate along with debris at the bottom of the legs, which prevented the development of a protective rust layer or patina. Although repeated maintenance and repair recommendations were documented in many inspection reports, the City of Pittsburgh (City) failed to act on them, leading to the deterioration of the fracture-critical transverse tie plate and the structural failure of the bridge.”

…

Although maintenance and repair recommendations were repeatedly made in the bridge inspection reports, the City failed to act on several of these recommendations, which led to progressive deterioration and the collapse of the bridge.

There’s a YouTube animation of the collapse by the NTSB:

https://youtu.be/J-VnWB4fiFk&t=195

The collapse animation begins at 3:15 of the video. The first 3-minutes provide a detailed overview of the bridge and the situation. In this disaster, the inspectors were not to blame – they had identified the problem in reports over and over for years.

NTSB recommendations: There are almost too many recommendations to include in one summary, but here are the big ones (in the opinion of the author of the original article):

To FHWA: Require a one-time review of the existing fracture-critical member (nonredundant steel tension member) inspection plans for bridges with nonredundant steel frame leg designs in its inventory.

To PennDOT: Develop and implement a plan to publish yearly aggregate data on bridge maintenance and repair recommendations.

To AASHTO: Update the Manual for Bridge Evaluation to include guidance that addresses the identification of localized tension zones and tension components in nonredundant steel members that are generally considered to be fully or partially in compression.

Lack of maintenance funding is a serious problem

The city of Pittsburgh, in the aftermath of the Fern Hollow Bridge collapse, stood up a Bridge Asset Management Program. Similar to any agency managing a large portfolio of constructed assets, the city is extremely underfunded for maintenance & repair. The Bridge Asset Management Program Overview indicates annual routine maintenance reeds of $9.75M across activities like cleaning & washing, deck joint repair/replacement, painting, crack sealing & patching, and deck overlays across an inventory of 99 vehicular and 47 pedestrian bridges.

Table 1: 2021 - 2024 Annual Maintenance funding, Pittsburgh Bridge Program

year	funding
2021	$0.39M
2022	$0.75M
2023	$0.95M
2024	$1.05M

The city faces an annual routine maintenance shortfall of $8.7M, or 8.3 times current available funding. Another report by the consultant, WSP, showed that city bridges will need nearly $500 million in improvements over the next 32 years, or about $15.6M per year.

My favorite part from the full article in the Pittsburgh Union Press (published by the striking workers at the Pittsburgh Post-Gazette, but that’s another story in itself) is the ending where the board chair, Jennifer Homendy, leveraged the quote that started this post by pointing out the paradox that problems cited in decades worth of inspection reports were ignored but the city built a new bridge replacement in less than a year:

She referred to a line in famed author Kurt Vonnegut’s 1990 novel, “Hocus Pocus,” that read: “Another flaw in the human character is that everybody wants to build and nobody wants to do maintenance.”

We need to figure out how to fund long-term maintenance needs for the nation’s infrastructure. Until we do, tragedies like the Fern Hollow Bridge collapse, and worse, will continue. Recent legislation like the Bipartisan Infrastructure Law (BIL) and the Great American Outdoors Act (GAOA) are band-aids on the wound unless long-term funding to close the ongoing maintenance gap is identified.

Full article

NTSB cites Pittsburgh, state, federal failures in Fern Hollow Bridge collapse, Pittsburgh Union Press:

https://www.unionprogress.com/2024/02/21/ntsb-cites-pittsburgh-state-federal-failures-in-fern-hollow-collapse/

Reference

Whole Bulding Design Guide - Operation & Maintenance Planning: As Figure 1 illustrates, 80% of a facility’s life-cycle costs are associated with Operation & Maintenance (O&M).

How to write ChatGPT prompts 5x better using Markdown

2024-02-17T17:00:00+00:00

Want to improve your LLM performance? Get 5x better ChatGPT performance with these simple #hacks.

I’ve seen a lot of tips around the Internet about how to improve prompts. Here’s some simple ones that rely on nothing more than some Markdown.

Use Headers to organize content

Use # for main headings.
Use ##, ###, etc., for subheadings.

# Main Heading

## Subheading

### Sub-subheading

Emphasize text for clarity

Use **text** for bold.
Use *text* or _text_ for italic.

I am **strongly** emphasizing this.

This is an _important_ note.

Create lists for better readability

Use - or * for unordered lists.
Use 1., 2., etc., for ordered lists

- First item
- Second item
- Third item

- First step
- Second step
- Third step

Include reference links

Use text to include a link.

Visit [OpenAI](https://openai.com)

Use code blocks for technical content

Use code blocks that begin and end with triple back ticks ```.

```

# This is a code block
print("Hello")

```

And one more example:

```Also a code block```

But a single line, command, or function can be done with single back ticks:

`print("this is a line of code")`

Use block quotes for citations

Use > for block quotes

Your quote goes here:

> Now is the time for all good men...

Add images

Use ![al text](image url) for images

![This is an image](https://example.com/image.jpg)

Which fonts?

2023-09-18T14:00:00+00:00

Which fonts to use for your charts and tables - Datawrapper Blog

I’ve been reading Lisa Charlotte Muth’s writing on Datawrapper and elsewhere for a while and recently stumbled on this post again. I like that instead of just listing the top X things you should do for Y, she covers all the tips by explaining why and providing examples of not ideal and better solutions along with plenty of examples from mainstream sites.

Next time you are working on a presentation with graphs and tables, keep these tips in mind.

Chart recreations–iPhone Success

2022-11-22T21:00:00+00:00

iPhone more successful than all other Apple products

I used to have an entire series of graph recreations but they’ve been lost to bit rot. I was reading another of Lisa Charlotte Rost’s posts over on Datawrapper, Better Charts, and wanted to try my hand at recreating the final iPhone graph in Tableau. I was mostly interested in seeing if I could reproduce it as closely as possible, and if I could get the highlighted time period in there.

Here’s my take on Tableau Public:

https://public.tableau.com/app/profile/scott.prestridge/viz/datawrapper-iPhone-public/iPhoneSuccessful

I think I nailed it.

Masking up

2020-10-30T22:00:00+00:00

There are fewer COVID-19 cases reported in states with higher rates of mask use. According to the data science behind the analysis and the data visualization this is a powerful argument for wearing a mask.

I came across this data and wanted the opportunity to practice my Excel visualization skills as well as my Python skills in applying a linear regression model to a data set. Linear regression is one of the family of algorithms used in supervised machine learning (ML) tasks. Supervised ML tasks are generally divided into classification and regression - with linear regression in the latter category. I am trying to predict a continuous number rather than a class or category. In this example, I am trying to predict the prevalence of COVID-19 cases by knowing how often people wear masks.

Regression tasks split into two main groups:

One feature is used to predict the target
More than one feature is used to predict the target

Since this example is using one feature (how often people wear masks) to predict the target (prevalence of COVID-19 in the community) the task falls into simple linear regression.

For all 50 states plus the District of Columbia (D.C.), the chart below plots the percentage of state residents who say they wear a mask in public all or most of the time (on the horizontal axis) and the percentage who say they know someone in their community with virus symptoms (on the vertical axis).

The r-squared of the CovidCast mask and symptom data is 0.73, meaning that you can predict about 73 percent of the variability in state-level COVID-19 symptom prevalence by knowing how often people wear their masks.

Yes, correlation is not causation. Certainly there are differences between the states beyond the use of masks. People in rural places may spend less time close to others so may feel less of a need to wear a mask. Many states with high mask usage had major outbreaks earlier in the pandemic - so mask wearing may be more common in these states. Never less, this data is interesting to data scientists and could be useful to epidemiological researchers interested in studying the public reaction to the pandemic and its spread.

The above was created quickly in Excel. I also wanted to practice my Python and see if I could recreate the analysis and data viz in code.

Using pandas, I read the CSV data into a dataframe called mask_up and assigned the wearing a mask data field to the x-values and the COVID-19 symptoms data field to the y-values and ran an ordinary-least-squares (OLS) regression. Once that was done it was easy to calculate the r-squared value:

r2 = r2_score(y, linefitline(x))

The plot took a bit of experimentation to get the settings and formats the way that I wanted. I’m still not happy with the linestyle but it will do for now. See below.

Wear your masks!

Here’s the code used to produce the graph above:

# plot line
plt.plot(x, line1, color = 'b', linestyle='-')

# plot scatter
for i, txt in enumerate(mask_up['state']):
    plt.annotate(txt, (x[i], y[i]), xytext=(0, 0), textcoords='offset points')
    plt.scatter(x, y)

# labels
plt.xlabel('Percentage of people wearing a mask in public all or most of the time')
plt.ylabel('Percentage of people who knows someone \n w/ COVID19 symptoms')
plt.text(68, 42, 'r-squared = {:.1%}'.format(r2))

plt.title('Mask up: Fewer COVID-19 symptoms reported in states \n with higher rates of mask use.', loc='left')

# grid
xmin = 60
xmax = 100
plt.xlim(xmin, xmax)

ymin = 10
ymax = 50
plt.ylim(ymin, ymax)

plt.show();

Producing Small Multiples of COVID-19 case rates by state

2020-06-05T22:00:00+00:00

Inspired by my love for small multiples, Horace Dediu’s recent work on Asymco, and the pandemic the world is experiencing I decided to put my Python skills to test and produce a small multiples plot of case rates for the United States.

I’m pleased with the result but already see things I would like to incorprate for v2.0.

If you’d like to use the script yourself I have made it available online. The plots are all COVID-19 case rates starting once a state reaches 30 cases/day. All data is 7-day average and the source is the NY Times COVID data from GitHub.

What trends do you see?

Where is Excel’s startup folder (XLStart)?

2020-01-30T14:00:00+00:00

Here’s a quick tip on using Excel and the Visual Basic Editor (VBE) to determine the location of Excel’s startup folder. Any workbook in this location is opened automatically every time you launch Excel. This is where Excel stores personal workbooks, like personal.xlsb, that can be launched everytime you use Excel. I think Excel will also store customized workbook templates, Book.xltx, here.

Since the location is rarely accessed it can be difficult to remember where it is located. The quickest way to locate it is to use the Immediate window in the VBE.

Press Alt-F11 to launch the VBE.
If the Immediate window isn’t visible, press Ctrl-g or use the View menu to open it.
In the Immediate window, type ? application.StartupPath and press Enter. VBA will display the path to XLStart.

You can copy and paste the path into File Explorer or a terminal prompt to navigate to the location.