a printed or scribed document bearing specialised calibration marks, the document being positioned on a work surface;
a camera focused on the document for generating video signals representing the document in electronic form; and
a processor linked to the camera for processing an image captured by the camera and configured to identify the calibration marks of the document in the captured image and then determine from the location of the calibration marks in the image a transformation between co-ordinates of features in the image and corresponding co-ordinates of features in the document that compensates for the freely variable positioning of the document on the work surface.
Plaque It!
Sponsored by: Flash of Genius |
[0001] The present invention relates to a method and apparatus for facilitating the use of a printed or scribed document, as an interface with a computer.
[0002] Over the decades since electronic computers were first invented, office practices have become dominated by them and information handling is now very heavily based in the electronic domain of the computer. The vast majority of documents are prepared, adapted, stored and even read in electronic form on computer display screens. Furthermore, in parallel to this, the computer interface technology has advanced from there being a predominantly physical interface with the computer using punched cards, keypads or keyboards for data entry—to the extensive present-day reliance on use of cursor moving devices such as the mouse for interacting with the screen-displayed essentially electronic interface known as the Graphical User Interface (GUI) that is in use universally in applications such as Windows®. The Graphical User Interface can be regarded as a virtual interface comprising operator key icons that replace the pushbutton keys of a physical keyboard.
[0003] The drive towards handling documents electronically and also representing hardware computer interfaces in a predominantly electronic form has been relentless since, amongst other obvious benefits, software implementations of hardware occupy no space and may be many orders of magnitude cheaper to produce. Nevertheless, electronic versions of documents and virtual interfaces do not readily suit the ergonomic needs of all users and uses. For some tasks, reading included, paper-based documents are much more user friendly than screen-based documents. Hard copy paper versions of electronic documents are still preferred by many for proof-reading or general reviews, since they are of optimally high resolution and flicker-free and less liable to give the reader eye-strain, for example.
[0004] In recent years the Xerox Corporation have been in the vanguard of developments to better integrate beneficial elements of paper based documents with their electronic counterpart. In particular they have sought to develop interface systems that heighten the level of physical interactivity and make use of computers to enhance paper-based operations.
[0005] Their European patent EP 0,622,722 describes a system in which an original paper document lying on a work surface is monitored by an overhead camera linked to a processor/computer to monitor the user's interaction with text or images on the paper document. An action such as pointing to an area of the paper document can be used to select and manipulate an image taken by the camera of the document and the image or a manipulated form of it is then projected back onto the work surface as a copy or modified copy. The Xerox interactive copying system is suited to this r{circumflex over (
[0006] One desirable type of paper-based interaction would be to print out a list of Internet search results and read through these on paper and to then use the printed paper to select an Internet page to be printed or displayed by simply pointing at the relevant area on the printed page. Aspects of such a use for paper-based computer interaction are further discussed in the present applicant's co-pending UK patent application.
[0007] Existing systems such as the aforementioned Xerox interactive copying system are not suited to such uses.
[0008] According to a first aspect of the present invention there is provided an apparatus which facilitates the use of a printed or scribed document as an interface with a computer and which comprises:
[0009] a said printed or scribed document bearing specialised calibration marks, the document being positioned on a work surface;
[0010] a camera focused on the document for generating video signals representing the document in electronic form; and
[0011] a processor linked to the camera for processing an image captured by the camera and configured to identify the calibration marks of the document in the captured image and then determine from the location of the calibration marks in the image a transformation between co-ordinates of features in the image and corresponding co-ordinates of features in the document that compensates for the freely variable positioning of the document on the work surface.
[0012] a camera focused on the document for generating video signals representing the document; and
[0013] a processor linked to the camera for processing an image captured by the camera and configured to identify the calibration marks of the document in the captured image and to then determine from the location of the calibration marks in the image
[0014] a transformation between co-ordinates of features in the image and co-ordinates of corresponding features in the document.
[0015] The specialised calibration marks on the document and which are recognised by the processor are specialised in the sense that they are distinct and readily distinguishable from the textual or graphic image information that otherwise constitutes the primary information content of the document. An example would be a cross or bulls-eye target as opposed to a letter of the alphabet if the primary information content of the document is text.
[0016] Suitably the processor holds in a memory information concerning the content of the document at specified co-ordinates of the document.
[0017] Preferably the memory holds a reference version electronic image of the document.
[0018] Suitably the document is further marked with a symbol which identifies the document page. This symbol could, at the simplest level, be merely a page number in numerical, letter or alphanumerical form but suitably identifies the wider document context—e.g. which brochure—as well as the number of the specific page. The symbol is preferably a machine readable code, and particularly preferably a bar code such as a two dimensional bar code.
[0019] In the context of marking the document page with a page identification symbol it is further preferred that the memory holds information on the co-ordinates where the page identification symbol is expected to be found on the document, such that once the processor has determined the correspondence between co-ordinates of the image and co-ordinates of the document, the co-ordinates within the image at which the symbol should be found may be processed to more rapidly locate and determine the identity of the document page. This may be of lesser importance where the symbol is in the readily recognisable machine readable form of a bar code but nevertheless speeds up the process of document page recognition.
[0020] Particularly preferably at least three calibration marks are provided on the document, being configured in an L-form if there are only three of them, or in a rectangle if there are four of them.
[0021] Suitably at least one of the calibration marks has the form of a target comprising a plurality of concentric rings or a bulls-eye with at least one concentric ring.
[0022] Particularly advantageously at least one of the calibration marks has a different form from the other calibration mark(s).
[0023] The document page identification symbol is preferably a bar code.
[0024] According to a second aspect of the present invention there is provided a method of using a printed or scribed document as an interface with a computer, which method comprises:
[0025] providing a said printed or scribed document bearing calibration marks, and a camera focused on the document for generating video signals representing in electronic form image information present within the field of view of the camera and a processor linked to the camera;
[0026] capturing an image of the document with the camera; and,
[0027] with the processor, identifying the calibration marks of the document in the captured image and then determining from the location of the calibration marks in the image a transformation between co-ordinates of features in the image and corresponding co-ordinates of features in the document.
[0028] According to a third aspect of the present invention there is provided a printed or scribed document bearing two or more visible and widely spaced apart specialised calibration marks thereon, configured for use in the aforementioned apparatus.
[0029] According to a fourth aspect of the present invention there is provided a method of preparing a printed document bearing specialised calibration marks as aforementioned which method comprises: providing a processor for preparing a printed document linked to a printer and having a memory holding data which dictates the form of the specialised calibration marks; and
[0030] with the processor and printer, printing two or more said calibration marks on to a sheet to produce the printed document.
[0031] Particularly suitably the processor and printer print the calibration marks onto the sheet at substantially the same time as printing the primary information content of the document onto the sheet.
[0032] Advantageously the calibration marks are printed on the sheet in the margins outside of the primary information content of the printed document.
[0033] According to a yet further aspect of the present invention there is provided an apparatus for preparing the aforementioned printed document, wherein the apparatus comprises a processor for preparing a printed document linked to a printer and having a memory holding data which dictates the form of the specialised calibration marks, the processor being configured for printing of the calibration marks together with the primary information content of the printed document.
[0034] A preferred embodiment of the present invention will now be more particularly described, by way of example, with reference to the accompanying drawings, wherein:
[0035]
[0036]
[0037]
[0038]
[0039]
[0040] Referring firstly to
[0041] The document
[0042] The calibration marks
[0043] The illustrated calibration marks
[0044] It is easy to robustly locate such a mark
[0045] Gonzalez R. & Woods R. Digital Image Processing, Addison-Wesley, 1992, pages 443-455; and Rosenfeld A. & Kak A. Digital Picture Processing (second edition), Volume 2, Academic Press, 1982, pages 61-73.
[0046] After thresholding, the pixels that make up each connected black or white region in the image are made explicit using a component labelling technique. Methods for performing connected component labelling/analysis both recursively and serially on a raster by raster basis are described in: Jain R., Kasturi R. & Schunk B. Machine Vision, McGraw-Hill, 1995, pages 4247 and Rosenfeld A. & Kak A. Digital Picture Processing (second edition), Volume 2, Academic Press, 1982, pages 240-250.
[0047] Such methods explicitly replace each component pixel with a unique label.
[0048] Black components and white components can be found through separate applications of a simple component labelling technique. Alternatively it is possible to identify both black and white components independently in a single pass through the image. It is also possible to identify components implicitly as they evolve on a raster by raster basis keeping only statistics associated with the pixels of the individual connected components (this requires extra storage to manage the labelling of each component).
[0049] In either case what is finally required is the centre of gravity of the pixels that make up each component and statistics on its horizontal and vertical extent. Components that are either too large or too small can be eliminated straight off. Of the remainder what we require are those which approximately share the same centre of gravity and for which the ratio of their horizontal and vertical dimensions agrees roughly with those in the calibration mark
[0050] The minimum physical size of the calibration mark
[0051] For the simple case of fronto-parallel (perpendicular) viewing it is only necessary to correctly identify two calibration marks 8 in order to determine the location and orientation of the documents and thereby compensate for the freely variable positioning of the documents. These can also be used to determine the scale of the document. Whereas for a camera
[0052] In the general case the position of two known calibration marks
[0053] Where (X, Y) is a point in the image and (X′, Y′) is the corresponding location on the document (
[0054] With just two identical calibration marks
[0055] Alternatively a third mark
[0056] Where the viewing direction is oblique (allowing the document
[0057] The perspective projection of the planar document
[0058] Where X′=x/w and Y′=y/w.
[0059] Once the transformation has been computed then it can be used for a range of purposes which may include firstly assisting in locating the document page identifier bar code 9 from expected co-ordinates for its location that may be held in a memory in or linked to the computer
[0060] The flow chart of
[0061] As noted above, in the embodiment of
[0062] Importantly, when some documents are printed—for example, HTML web pages—the layout of the printed document is substantially different to that on screen. For web pages the format varies with the parameters of the browser used to view it. By embedding in the markings
[0063] Although in the present invention it is preferred that the printed document
[0064] In preparing the printed document