TY - GEN AU - Yu, Hsiang-Fu AU - Jain, Prateek AU - Kar, Purushottam AU - Dhillon, Inderjit S. A2 - T1 - Large-scale Multi-label Learning with Missing Labels JO - PB - C1 - PY - 2013/ VL - IS - SP - EP - UR - http://arxiv.org/abs/1307.5101 DO - KW - classification KW - kallimachos KW - label KW - large KW - learning KW - multi L1 - N1 - Large-scale Multi-label Learning with Missing Labels N1 - AB - The multi-label classification problem has generated significant interest in

recent years. However, existing approaches do not adequately address two key

challenges: (a) the ability to tackle problems with a large number (say

millions) of labels, and (b) the ability to handle data with missing labels. In

this paper, we directly address both these problems by studying the multi-label

problem in a generic empirical risk minimization (ERM) framework. Our

framework, despite being simple, is surprisingly able to encompass several

recent label-compression based methods which can be derived as special cases of

our method. To optimize the ERM problem, we develop techniques that exploit the

structure of specific loss functions - such as the squared loss function - to

offer efficient algorithms. We further show that our learning framework admits

formal excess risk bounds even in the presence of missing labels. Our risk

bounds are tight and demonstrate better generalization performance for low-rank

promoting trace-norm regularization when compared to (rank insensitive)

Frobenius norm regularization. Finally, we present extensive empirical results

on a variety of benchmark datasets and show that our methods perform

significantly better than existing label compression based methods and can

scale up to very large datasets such as the Wikipedia dataset. ER - TY - JOUR AU - Binder, M. AU - Hibbett, D. S. AU - Larsson, K. H. AU - Larsson, E. AU - Langer, E. AU - Langer, G. T1 - The phylogenetic distribution of resupinate forms across the major clades of mushroom-forming fungi (Homobasidiomycetes) JO - Systematics and Biodiversity PY - 2005/06 VL - 3 IS - 2 SP - 113 EP - 157 UR - /brokenurl#://000231684600001 DO - KW - DNA-sequences KW - community KW - corticiaceae KW - corticioid KW - ectomycorrhizal KW - fungi KW - heterobasidiomycetes KW - large KW - mycobionts KW - nuclear KW - parsimony KW - piriformospora-indica KW - polyporaceae KW - ratchet KW - rdna KW - ribosomal KW - sebacinoid KW - septal KW - sequences KW - significance KW - status KW - structure KW - subunit KW - systematics KW - taxonomic KW - taxonomy KW - tylospora-fibrillosa KW - ultrastructure L1 - SN - N1 - N1 - AB - Phylogenetic relationships of resupinate Homobasidiomycetes (Corticiaceae s. lat. and others) were studied using ribosomal DNA (rDNA) sequences from a broad sample of resupinate and nonresupinate taxa. Two datasets were analysed using parsimony, a'core'dataset of 142 species, each of which is represented by four rDNA regions (mitochondrial and nuclear large and small subunits), and a 'full' clataset of 656 species, most of which were represented only by nuclear large subunit rDNA sequences. Both datasets were analysed using traditional heuristic methods with bootstrapping, and the full clataset was also analysed with the Parsimony Ratchet, using equal character weights and six-parameter weighted parsimony. Analyses of both datasets supported monophyly of the eight major clades of Homobasicliomycetes recognised by Hibbett and Thorn, as well as independent lineages corresponding to the Gloeophyllum clade, corticioid clade and jaapia argillacea. Analyses of the full clataset resolved two additional groups, the athelioid clade and trechisporoid clade (the latter may be nested in the polyporoid clade). Thus, there are at least 12 independent clades of Homobasicliomycetes. Higher-level relationships among the major clades are not resolved with confidence. Nevertheless, the euagarics clade, bolete clade, athelioid clade and jaapia argillacea are consistently resolved as a monophyletic group, whereas the cantharelloid clade, gomphoid-phalloid clade and hymenochaetoid clade are placed at the base of the Homobasidiomycetes, which is consistent with the preponderance of imperforate parenthesomes in those groups. Resupinate forms occur in each of the major clades of Homobasidiomycetes, some of which are composed mostly or exclusively of resupinate forms (athelioid clade, corticioid clade, trechisporoid clade,jaapia). The largest concentrations of resupinate forms occur in the polyporoid clade, russuloid clade and hymenochaetoid clade. The cantharelloid clade also includes many resupinate forms, including some that have traditionally been regarded as heterobasidiomycetes (Sebacinaceae, Tulasnellates, Ceratobasidiales). The euagarics clade, which is by far the largest clade in the Homobasidiomycetes, has the smallest fraction of resupinate species. Results of the present study are compared with recent phylogenetic analyses, and a table summarising the phylogenetic distribution of resupinate taxa is presented, as well as notes on the ecology of resupinate forms and related Homobasidiomycetes. ER - TY - JOUR AU - Clauset, Aaron AU - Newman, M. E. J. AU - Moore, Cristopher T1 - Finding community structure in very large networks JO - Physical Review E PY - 2004/ VL - 70 IS - SP - EP - UR - http://www.citebase.org/abstract?id=oai:arXiv.org:cond-mat/0408187 DO - KW - clustering KW - community KW - large KW - networks KW - toread L1 - SN - N1 - [cond-mat/0408187] Finding community structure in very large networks N1 - AB - ER - TY - JOUR AU - Clauset, Aaron AU - Newman, M.E.J. AU - Moore, Cristopher T1 - Finding community structure in very large networks JO - Physical Review E PY - 2004/ VL - 70 IS - SP - EP - UR - http://www.citebase.org/cgi-bin/citations?id=oai:arXiv.org:cond-mat/0408187 DO - KW - community KW - detection KW - gn KW - large KW - network KW - newman KW - structure L1 - SN - N1 - N1 - AB - ER - TY - GEN AU - Clauset, Aaron AU - Newman, M. E. J. AU - Moore, Cristopher A2 - T1 - Finding community structure in very large networks JO - PB - C1 - PY - 2004/08 VL - IS - SP - EP - UR - http://arxiv.org/abs/cond-mat/0408187 DO - KW - clustering KW - large KW - network KW - community L1 - N1 - N1 - AB - The discovery and analysis of community structure in networks is a topic of

considerable recent interest within the physics community, but most methods

proposed so far are unsuitable for very large networks because of their

computational cost. Here we present a hierarchical agglomeration algorithm for

detecting community structure which is faster than many competing algorithms:

its running time on a network with n vertices and m edges is O(m d log n) where

d is the depth of the dendrogram describing the community structure. Many

real-world networks are sparse and hierarchical, with m ~ n and d ~ log n, in

which case our algorithm runs in essentially linear time, O(n log^2 n). As an

example of the application of this algorithm we use it to analyze a network of

items for sale on the web-site of a large online retailer, items in the network

being linked if they are frequently purchased by the same buyer. The network

has more than 400,000 vertices and 2 million edges. We show that our algorithm

can extract meaningful communities from this network, revealing large-scale

patterns present in the purchasing habits of customers. ER - TY - CONF AU - Kubica, Jeremy AU - Moore, Andrew AU - Schneider, Jeff A2 - Wu, Xindong A2 - Tuzhilin, Alex A2 - Shavlik, Jude T1 - Tractable Group Detection on Large Link Data Sets T2 - The Third IEEE International Conference on Data Mining PB - IEEE Computer Society C1 - PY - 2003/november CY - VL - IS - SP - 573 EP - 576 UR - DO - KW - large KW - detection KW - community KW - network KW - gda L1 - SN - N1 - N1 - AB - ER - TY - RPRT AU - Kubica, Jeremy Martin AU - Moore, Andrew AU - Schneider, Jeff A2 - T1 - K-groups: Tractable Group Detection on Large Link Data Sets PB - Robotics Institute, Carnegie Mellon University AD - Pittsburgh, PA PY - 2003/10 VL - IS - CMU-RI-TR-03-32 SP - EP - UR - http://www.ri.cmu.edu/pubs/pub_4489.html DO - KW - large KW - detection KW - gda KW - network KW - community L1 - N1 - N1 - N1 - AB - Discovering underlying structure from co-occurrence data is an important task in many fields, including: insurance, intelligence, criminal investigation, epidemiology, human resources, and marketing. For example a store may wish to identify underlying sets of items purchased together or a human resources department may wish to identify groups of employees that collaborate with each other.

Previously Kubica et. al. presented the group detection algorithm (GDA) - an algorithm for finding underlying groupings of entities from co-occurrence data. This algorithm is based on a probabilistic generative model and produces coherent groups that are consistent with prior knowledge. Unfortunately, the optimization used in GDA is slow, making it potentially infeasible for many real world data sets.

To this end, we present k-groups - an algorithm that uses an approach similar to that of k-means (hard clustering and localized updates) to significantly accelerate the discovery of the underlying groups while retaining GDA's probabilistic model. In addition, we show that k-groups is guaranteed to converge to a local minimum. We also compare the performance of GDA and k-groups on several real world and artificial data sets, showing that k-groups' sacrifice in solution quality is significantly offset by its increase in speed. This trade-off makes group detection tractable on significantly larger data sets. ER - TY - BOOK AU - Cullum, J.K. AU - Willoughby, R.A. A2 - T1 - Lanczos algorithms for large symmetric eigenvalue computations: Theory PB - Society for Industrial Mathematics C1 - PY - 2002/ VL - IS - SP - EP - UR - http://scholar.google.de/scholar.bib?q=info:zshJq2GVHO8J:scholar.google.com/&output=citation&hl=de&ct=citation&cd=0 DO - KW - algorithms KW - eigenvalue KW - large KW - symmetric L1 - SN - N1 - N1 - AB - ER - TY - CONF AU - Wang, Haixun AU - 0010, Wei Wang AU - Yang, Jiong AU - Yu, Philip S. A2 - Franklin, Michael J. A2 - Moon, Bongki A2 - Ailamaki, Anastassia T1 - Clustering by pattern similarity in large data sets. T2 - SIGMOD Conference PB - ACM C1 - PY - 2002/ CY - VL - IS - SP - 394 EP - 405 UR - http://dblp.uni-trier.de/db/conf/sigmod/sigmod2002.html#WangWYY02 DO - KW - algorithm KW - clustering KW - data KW - fca? KW - large KW - pClusters KW - paper KW - pattern L1 - SN - 1-58113-497-5 N1 - dblp N1 - AB - ER - TY - CONF AU - Hovy, E.H. A2 - T1 - Combining and Standardizing Large-Scale, Practical Ontologies for Machine Translation and Other Uses T2 - Proc. 1st Intl. Conf. on Language Resources and Evaluation (LREC) PB - C1 - Granada PY - 1998/ CY - VL - IS - SP - EP - UR - http://www.isi.edu/natural-language/people/hovy/publications.html DO - KW - ontology KW - large KW - scale KW - application KW - practical L1 - SN - 3-540-41066-X N1 - N1 - AB - ER - TY - CONF AU - Zhang, Tian AU - Ramakrishnan, Raghu AU - Livny, Miron A2 - T1 - BIRCH: an efficient data clustering method for very large databases T2 - Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data (SIGMOD'96) PB - C1 - PY - 1996/ CY - VL - IS - SP - 103 EP - 114 UR - http://citeseer.ist.psu.edu/zhang96birch.html DO - KW - birch KW - clustering KW - database KW - large KW - tree L1 - SN - N1 - N1 - AB - ER - TY - JOUR AU - Berry, M. AU - Do, T. AU - O’Brien, G. AU - Krishna, V. AU - Varadhan, S. T1 - SVDPACKC (version 1.0) user's guide JO - University of Tennessee PY - 1993/ VL - IS - SP - EP - UR - http://scholar.google.de/scholar.bib?q=info:Y2fs0GQQ_LIJ:scholar.google.com/&output=citation&hl=de&ct=citation&cd=0 DO - KW - large KW - lib KW - scale KW - sparse KW - svd L1 - SN - N1 - N1 - AB - ER - TY - JOUR AU - Sherman, Andrew H T1 - Algorithms for sparse Gaussian elimination with partial pivoting JO - ACM Trans. Math. Softw. PY - 1978/ VL - 4 IS - 4 SP - 330 EP - 338 UR - http://portal.acm.org/citation.cfm?id=356494 DO - http://doi.acm.org/10.1145/356502.356494 KW - large KW - math KW - matrix KW - scale KW - svd L1 - SN - N1 - N1 - AB -

"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">

Algorithms for sparse Gaussian elimination with partial pivoting

ACM Home Page

Please provide us with feedback. Feedback

Algorithms for sparse Gaussian elimination with partial pivoting
Full text

PdfPdf

(564 KB)


Source

ACM Transactions on Mathematical Software (TOMS)

archive

Volume 4 ,  Issue 4  (December 1978)

table of contents

Pages: 330 - 338  

Year of Publication: 1978

ISSN:0098-3500

Author

Andrew H Sherman

 Department of Computer Sciences, Painter 328, The University of Texas at Austin, Austin, TX

Publisher

ACM 

New York, NY, USA

Bibliometrics

Downloads (6 Weeks): 2,   Downloads (12 Months): 58,   Citation Count: 2

Additional Information:

references  

cited by  

index terms  

collaborative colleagues  

peer to peer  

Tools and Actions:

Request Permissions Request Permissions   

Review this Article  

Save this Article to a Binder

  

Display Formats:

BibTeX 

EndNote

ACM Ref

  

DOI Bookmark:

Use this link to bookmark this Article: http://doi.acm.org/10.1145/356502.356494
What is a DOI?


REFERENCES

 

1

 

2

CURTIS, A.R., AND REID, J.K. FORTRAN subroutines for the solution of sparse sets of hnear equations. AERE Rep. R6844, AERE HarweU, England, 1971.

 

3

CURTIS, A.R, AND REID, J.K The solution of large sparse unsymmetnc systems of hnear equations. Information Processing '71 (1971), 1240-1245.

 

4

DUFF, I.S., AND REID, J.K A comparison of sparsity ordermgs for obtaining a pivotal sequence in Gausslan elm~matlon. AERE Rep. T.P. 526, AERE Harwell, England, 1973

5

 

6

FORSYTHE, G.E., AND MOLER, C B Computer Solutton of Linear Algebraic Systems Prentice- Hall, Englewood Cliffs, N J, 1967

 

7




Peer to Peer - Readers of this Article have also read:


ER -