Category Archives: data

Worldscope data available via Datastream


Thomson Reuter’s Worldscope data are now available via Datastream on selected data workstations along the service area in Watson Library of Business and Economics.

Summary of the Worldscope database from Thomson’s website:

“The Worldscope database offers fundamental data on the world’s leading public and private companies. Widely respected for content quality, depth of detail, extensive company coverage and content presentation, Worldscope provides annual and interim/quarterly data, detailed historical financial statement content, per share data, calculated ratios, pricing and textual information. Worldscope database components include: Full constituent coverage of the major benchmark indexes worldwide; as reported and standardised financial data with dynamic data mapping; detailed business descriptions; business segment geographic sales data; interim financial accounts information; multi-share information; pre-calculated ratios and growth rates; exchange rates; fully adjusted pricing and dividends information, including high, low, close, price/earnings ratio, yield, earnings per share, market value, volume traded & shares outstanding.

Adjustments for corporate actions are also included and comprehensive annual history dates back to 1980.”

Guides/Documents on Worldscope

New database: Morningstar Direct

Morningstar Direct provides a global multi-asset investment analysis platform with Morningstar’s data, investment research, and analytics tools. Includes data on closed-end funds, equity ownership, exchange traded funds, hedge funds, market indexes, money market funds, offshore funds, open-end mutual funds, pension/life insurance funds, stocks, bonds, annuities, college-saving plans, information on Global Private Equity Firm Executives and Owners, and a directory of Global Private Equity Firms.

Access to Morningstar Direct is available on selected business workstations in Watson library.



New database: Preqin Hedge Fund Analyst

Preqin Hedge Fund Analyst “provides all the latest data and intelligence on hedge funds in one place. Information is updated on a daily basis by a team of skilled research analysts based around the globe, providing an extremely reliable and comprehensive resource.”

“Hedge Fund Analyst features include:

Hedge Fund Performance
Fund-by-fund and industry level performance data covering over 6,300 funds, with 4,500 reporting monthly returns, providing you with a true market overview. You can also compare funds to specific segments of the market, including by strategy, fund structure, geography and more, using the fully customizable benchmarks function.

Hedge Fund Profiles
Instantly analyze industry overview statistics of the current fund universe as well as searchable in-depth fund profiles showing all the key information for over 12,000 individual hedge funds (21,000 with share classes).

Hedge Fund Managers
Search over 5,700 fund manager profiles from across the world and view key preferences and personnel contact details. With a subscription to Hedge Fund Investor Profile you can also view the investors in individual funds.

Fund Terms and Conditions
Fund-by-fund and industry level fund terms and conditions data for over 5,500 vehicles provide a market overview, enabling you to see the current trends for specific fund types, structures and investment focuses.”

All of above excerpts are from Preqin website.

Sources for Supply Chain data

Supply chain data reveals the business relationships between companies and allows you to analyze the capital flow between companies on both a customer (revenue) and supplier (cost) basis.  Generally, supply chain data for public companies can be found in a variety of sources such as public filings (including annual and other periodic reports), conference call transcripts, capital markets presentations, sell-side conferences, company press releases, company websites, and industry insiders.

Typically, supply chain data is incomplete due to the fact that SEC Regulation S-K requires companies to disclose only supply-chain relationships for customers that provide revenues greater than 10% on an annual basis.  Additionally, information on international suppliers and customers is difficult to identify.

To find supply chain data, you can use either Bloomberg or Capital IQ to get a holistic view of the supply chain structure for a particular company of interest.

Bloomberg covers supply chain relationship for over 35,000 companies and the data is updated regularly. To display supply chain analysis in Bloomberg, type the mnemonic SPLC. Please note that SPLC function can only be used together with an equity. For example, to find Apple’s supply chain information, type:


You can sort the data by suppliers and customers and display the results in a table format.  For example, you can view the suppliers list like the following:

Similarly, Capital IQ also presents supply chain data under the “business relationships” category.  Additionally, Capital IQ provides the sources for the data.

New database: Pratt’s Guide

The Pratt’s Guide to Private Equity and Venture Capital Sources is a comprehensive directory of private equity and venture capital firms.

From Thomson Reuters:

“The Pratt’s Guide to Private Equity and Venture Capital Sources  the definitive reference source to actively investing private equity and venture capital firms operating around the world. Pratt’s Guide is continually updated with new fund-raising data, new investment and exit data and new contact information.

Pratt’s Guide is substantially researched and managed by a dedicated team specializing in this market. To be listed in Pratt’s, analysts conduct in-depth, independent inquiries of all companies; evaluating in detail their capability, profile and scale of activity. This is supported by a breakdown of their actual investment activity within specific industries and sectors.”


IHS Global Insight


IHS Global Insight has economic forecasts, industry analysis and market intelligence for over 200 countries and 170 industries. With hundreds of expert economists and analysts worldwide, you can tap into our global economic outlook, analysis and data to monitor and assess business opportunities and risks around the world.

Detailed coverage of the U.S. and Global Economies 

  • Detailed analysis of industrial activity, business investment, consumer markets, housing, international trade, labor markets, energy, inflation, financial markets, government policy, and alternative scenarios
  • U.S. regional analysis covering all 50 states, 390 metro areas, and 3,100 counties for 150 economic-performance indicators

Economic data  

  • The largest repositories of global economic, financial, and industry data
  •  Economic data for over 200 countries and over 120 industries worldwide


New database: Markit

Columbia Libraries is pleased to announce that now we have access to Markit via Wharton Research Data Services (WRDS). Columbia Libraries subscribe to Markit services including CDS Pricing (End of Day) and RED-CDS (Full Service).

From the WRDS website:

Markit CDS Pricing: Provides CDS composite and contributor level data on approximately 2,800 individual entities. Applying a data cleansing process, Markit aggregates marks from sell-side contributors to generate its composite price.  The data are provided on a daily basis starting in 2001 (with 810 reference entities) and are available by entity, tier, currency and restructuring clause.

RED (Reference Entity Database):
a database of confirmed Reference Entities and their Reference Obligations for use in trading, documenting, and processing credit derivatives transactions. A Reference Entity is an institution, such as a company, a sovereign, a statute-created entity, or a multilateral institution, that has either issued or guaranteed an obligation (a Reference Obligation). These Reference Obligations are typically issuances of debt obligations, but on occasion they may also be non-debt securities (such as preference shares). Pairs of Reference Entities and their associated Reference Obligations (Pairs) form the basis of RED.  RED provides a means of linking CUSIP and ISIN to other Markit databases (e.g. CDS) which are identified by the Markit primary id (REDCODE).  RED also allows the data user to track CDS reference entities through mergers, de-mergers and other corporate actions.”

Google Ngrams: the power of grammar

Back in December 2010, Google unveiled an online tool for analyzing the history of language and culture as reflected in the gargantuan corpus of historical texts that have been scanned and digitized as part of the Google Books project. They called the interface the Ngram Viewer, and it was launched in conjunction with a blockbuster paper in the journal Science that baptized this Big Data approach to historical analysis with the label “culturomics.”

There are lots of examples out there including a Tumblr feed.

As of today, the Ngram Viewer just got a whole lot better. For starters, the text corpus, already mind-bogglingly big, has become much bigger: The new edition extracts data from more than eight million out of the 20 million books that Google has scanned. That represents about six percent of all books ever published, according to Google’s estimate. The English portion alone contains about half a trillion words, and seven other languages are represented: Spanish, French, German, Russian, Italian, Chinese, and Hebrew…more

Run your own experiment!

Raw data is available for download here.

Info from The, original article available online.

New database: Morningstar Investment Research Center

Morningstar investment research center provides financial data on companies traded on the New York Stock Exchange, NASDAQ®, and the American Stock Exchange; in-depth research reports on companies; data on mutual, closed-end, and exchange-traded funds; analysts’ reports on funds; exclusive investment tools and engaging educational features, including interactive training on basic and advanced stock, fund, and portfolio techniques.

Newsletters section is not part of Columbia subscription.