What’s New in Logos 6

Logos 6 LaunchFaithlife Corporation (formerly known as Logos Bible Software) announced the release of Logos 6 today, the newest version of our flagship product for Bible study. As my colleague Rick Brannan put it in his blog post, “here’s how I’ve spent the last two years”: building the best version ever of this amazing product. So I’m dusting off my blog to talk about what makes Logos 6 so great.

There’s so much to talk about! I plan to create a separate post for each of several new features so they have their own focus.

But to whet your appetite for more, here are a couple of videos that overview some of the new features (and include me and several of my colleagues) from our YouTube channel:

There’s a whole collection here of brief videos on new features and datasets.


BibleTech talk: Automatically Learning Topical Content

I’ve posted the slides from my BibleTech 2013 talk. Here’s the abstract:

Continued work on the Logos Controlled Vocabulary (BibleTech 2010, “A Controlled Vocabulary for Biblical Studies”) has produced a unique collection of topic-aligned content across more than 50 different Bible dictionaries, encyclopedias, and topical indexes in both English and Spanish. This presentation will describe the information we’re learning automatically from this content, including:

  • determining concept importance
  • associating concepts with Bible references
  • extracting and associating names and descriptive terms for concepts
  • relating concepts to each other

You can see the other talks at the BibleTech website. I’ve had a number of positive comments on the talk, which is always gratifying. Slowly but surely, we’re climbing up the data stack …

3/21/2013 update: I’ve used the Slidy framework for presentations for several years because i like the way it puts the whole content out on the web in HTML. However, @JohnRGentry pointed out that my slides don’t work on the iPad because moving forward and backward requires keys. There’s a newer version of Slidy which does support swiping to move through slides, though my experience with it on both Safari and Chrome on iOS hasn’t been great: it’s not easy to register a swipe, and title text sometimes gets lost. I assume these are issues with javascript support on iOS, though i’m not really sure. I’ll try to update my slides to the newest version of Slidy, which will help a little, but i’ll also look for another framework that’s more tablet friendly.

Logos 5, Behind the Curtain: Curation

This post is part of Logos 5, Behind the Curtain, a series of blog posts looking at new data sets that are part of the latest Bible study application from Logos Bible Software.

At the heart of Logos’ approach to data is the practice of curation. If you’ve heard this term at all, it’s probably in the context of museums: but we mean something rather different. As usual, Scott Adams’ Dilbert has the pulse of corporate America:

The Official Dilbert Website featuring Scott Adams Dilbert strips, animations and more

For Logos, though, curation is not just trendy new jargon, but an essential practice that we’ve been pursing for quite a few years now (five under my direction, and more before my tenure). It’s a critical part of what makes Logos unique in its market. For example, we have not one but three Lexical Curators working on the Bible Sense Lexicon (one for Greek, two for Hebrew). To my knowledge (and Google’s), these appear to be the only people in the world with this job title. In fact, most people in the Content Innovation department at Logos are doing one kind of curation task or another.

So what is curation in the context of Bible study software, anyway? In the simplest terms, curation means organizing and maintaining a collection of things. In the museum context, that’s artifacts they display. Our kind of curation involves computer-readable data that captures knowledge relevant to biblical studies. You can summarize it briefly with three key practices:

  1. Collect
  2. Correct
  3. Connect

(“Describe” is probably a more apt term than “Correct”, but i just couldn’t resist the awesome alliteration.)

To “Collect” means imposing organization on a complicated and messy world of information, with an eye toward structuring it and making it useful in some way in the Logos system. Often this information is expressed in prose sentences in a reference book: for example, a Bible dictionary might describe a city, and include language about where the city was located, what larger region it was part of, where it’s mentioned in the Bible, etc. Fundamentally, a plan for collection requires deciding what information ought to be collected, and which things belong in the collection and which ones are excluded (in more technical terms, an ontology).

Capturing and formalizing knowledge typically involves some tradeoffs. First of all, we have to find categories and labels that balance and maximize utility and (formal) correctness. When it comes to categorization in particular, ultimately “everything is miscellaneous” (which is also the title of a really important book by David Weinberger), and if you push hard enough, each thing is its own unique category . But data sets are typically more useful if things are grouped in some way. So we categorize the following as “people” in our data set:

  • individuals (whether named or not)
  • groups, whether defined by residence (Greeks), common ancestry (Levites), belief systems (Pharisees and Sadducees) , etc.
  • supernatural entities (including those that most Bible readers would accept like the God of Israel, and those that biblical authors argue are not real at all, like Baal, Asherah, or Zeus)

To “Correct” or “Describe” means that we choose and populate particular attributes for the items we’re curating. In the case of people, that includes things like names, gender, and roles. In addition, we create three special attributes for nearly every data set:

  • a unique identifier: though you’ll probably know who i mean when i use a biblical name like “David”, you won’t for “John” because there are five different individuals known by that name. This kind of ambiguity, and other variations (“John”, “John the Baptist”, “John Baptizer”, etc.) means names are usually poor identifiers. Instead, we create a symbol like “John.4” that uniquely identifies one particular item in our collection (in this case, a member of the Sanhedrin mentioned in Acts 4:6). Since we don’t show these to users, we don’t have to worry about people understanding them.
  • a label: since our data is for people to look at and understand, we need user-friendly ways to display an entity. Labels are typically brief (less than 20 -25 characters), and also unique, so that in a drop-down list showing names that match “John”, i can distinguish “John (the Baptist)” from “John (Ac 4:6)”. Since the label is also unique, we could use it as the identifier: but since data stability is a primary goal, we separate the two, so that we can change the label if necessary. Since identifiers (not labels) are the means by which we connect data, we (almost) never change them, since that risks breaking the integrity of the data set.
  • a description: labels are brief so they take up minimum space, but consequently, they can’t carry much information. So we often provide a longer prose description, perhaps a sentence or two, that helps identify the entity and its most basic information. You could compare this to the leading sentence in a Wikipedia article. In the case of John.4, that’s “a member of the family of high priests in Jerusalem following Jesus’ ascension.”, which is probably enough to help you decide whether this is a John you want to know more about or not.

To “Connect” means linking entities to other entities (or other data sets). For people, family relationships are an important ways that people connect to each other. We also label those relationship (father, mother, sister), and, for biblical information, capture the textual sources that support this relationship (more technically, the provenance of the data).

Connecting information is one of the most important aspects of curation for Logos. While it may be interesting to learn that King David was also a shepherd, that’s an isolated fact. But if you can get a list of other individuals in the Bible who were also shepherds (or kings, or musicians), now you’re discovering new information. You might not have started out looking for this, or known how to find it for yourself.

Question: which part of Logos’ curation process (Collect, Correct, Connect) do you find the most interesting or appealing? Please leave me a comment.

(Edit: saw a good piece today by John Chambers of Cisco about the power of connection. http://www.wired.com/insights/2012/12/the-internet-of-everything/)

Logos 5, Behind the Curtain: Introduction

I’ve fallen off in my blogging quite a bit over the last few years: my last post (a book review) was December 2011, and, other than conference reports and book reviews, it’s rather sparse for the year or two prior to that.

But i’m excited to be reviving my blog and starting a new series today to celebrate the release of Logos 5. In Bob’s Pritchett’s overview post on the Logos blog, he talks about the importance of connection in Bible study, and says [italics mine]

Logos Bible Software 5 is a significant update that is all about connection.

This focus on connection is not just marketing talk or a conceptual metaphor. It describes in a very concrete way the important new datasets that make Logos 5 a major contribution to the world of biblical studies. I recognize that’s a mighty big claim, but i plan to back it up by describing the vision, architectures, effort, and technical approaches that have made the new features of Logos 5 a reality. In my role as Director of Content Innovation at Logos, i lead a talented and hard-working team of people who have spent the last several years making all these connections. (it wasn’t my idea, but that’s me in the “What’s new in Logos 5” video) Over this series of posts, i hope you’ll gain a better appreciation for what went into these new features, and what makes them so important.

I’d love to hear your comments and questions. For example: which of the new Logos 5 features do you find most useful?

Review: Beautiful Data by Toby Segaran and Jeff Hammerbacher

Segaran (author of the highly-recommended Collective Intelligence) and Hammerbacher have collected together a diverse set of essays on data collection, visualization, processing, and analysis. What interested me most was the wide variety of application areas in which data is the “secret sauce”. The essays range from broadly philosophical to deep in the technical details: so you’re likely to find something at your level of interest (though that also means that much of the book may not hit your level).

Jeff Hammerbacher’s chapter on Information Platforms and the Rise of the Data Scientist is a good example. It discusses Facebook’s history of scaling its data storage and analysis capabilities, starting with custom scripting based on SQL, moving to data warehousing and then beyond to Hadoop and related tools. “More data, simple models” is the processing style that characterizes many such Big Data enterprises today.

Other valuable chapters for me:

  • Data Finds Data (Jeff Jonas and Lisa Sokol)
  • Natural Language Corpus Data (Peter Norvig)
  • Connecting Data (Toby Segaran)

While you’re not likely to find a solution here to specific technical problems, there’s a good chance you’ll find something either to broaden your horizons or give you some new ideas. Definitely recommended.

(Disclosure: I received a free copy of this book through O’Reilly’s Blogger Review program.)

BibleTech Talk Slides: Using the Bible Knowledgebase For Information Integration

Finally got my slides posted from BibleTech:2011 on Using the Bible Knowledgebase for Information Integration. Since i listened to good advice and went a little more toward graphics than bullet points, they’re not completely self-explanatory (but that’s why you should have come, right?).

Audio will show up too at some point, probably at http://www.bibletechconference.com/speakers.

As i’ve told a few of my colleagues since: giving the talk helped convince me even more strongly that Biblical Events will be a really important database for Bible study. Looking forward to getting it all put together.

Seasons and Cycles in your Bible Study

Listening to a podcast by Justin Maxwell for CHI Conversations* raised an interesting question. He was talking about how we all have cycles and mood changes in our lives that affect our interaction with software: the lunch time at the gym, the afternoon doldrums. Based on his previous work with Mint, there were big differences in people’s interactions around paydays, when there’s both a large inflow of money and a lot of bills to be paid. College students tend to break up more frequently before Thanksgiving and Christmas.

Which leads to the question: what are your cycles and moods with respect to Bible study (whether via software or print)? For most of my life i’ve considered morning the optimum time for personal Bible reading: my mind is fresh, and i can take those thoughts with me into the day. Sunday morning (and maybe Wednesday Night Bible Study for the churches that still have them) are obvious times of higher activity. What about Saturday morning compared to weekday mornings: more or less usage? More Bible reading during Lent?

In the digital age, Bible search engines and programs that talk to the cloud have the potential to identify some of these variations. For example, someone associated with one of the large Bible search sites told me they saw a spike in usage in the late hours of Saturday night (pastors preparing their sermons?). We’ve seen some similar upticks in various websites operated by Logos, though i haven’t been able to do a careful analysis.

So i’d be interested to learn more about the cycles, moods, and seasonal fluctuations of Bible reading and study. I’m interested in your own personal reflections, but even more in any studies or data you might be aware of.

*“CHI Conversations covers Computer/Human Interaction, including design, human factors, cognitive psychology, social science, and more.”

Aaron Marshall: User Adoption Strategy

It really changes things when the CEO gets on board with digital literacy. Book by Michael Sampson User Adoption Strategies.

No adoption = no value: you have to plan for adoption. Rogers Bell Curve: perceived utility and ease-of-use matter a lot, which comes back to design. Tip: establish a glossary. “It’s really hard to sit behind someone using your software and not tell them what to do”. “Ideas are cheap, but they still feel like my heart.” “Analytics is the one area I’ve neglected most.” Everything BIG started small. Progressive disclosure: give people a slow introduction to features, don’t overwhelm them up front.

Some interesting sites for augmented reality:

  • stickybits: attach comments to physical objects with barcodes.
  • Greengoose.com: temperature/sound/vibration sensors. Instrumentation of everything.
  • GE smart grid
  • Layar: find people who tweeted nearby, wikipedia articles. You can create your own.

Steven Cummings: Bringing the Power of Search to Mobile

The mobile revolutions means the goal of software now must be to reach the user wherever they are.

BibleReader 5 is their application: showed it on iPad. Originally used EverNote for note synchronization, but wasn’t a good fit. Resource Guide is a new additional to pull in everything in their library that relates to the passage you’re reading. Other library integration around lexicon entries.